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ON THE SMALL FORCED VIBRATIONS OF SYSTEMS 
WITH ONE DEGREE OF FREEDOM 


By Maxime Bocuer 


THE ordinary treatment* of the problem of the small forced vibrations of 
a system with one degree of freedom produced by a periodic impressed force 
is open to two objections. On the one hand, the solution is obtained in the 


form of a triganamateia anat-~ 0 --- ¢ sme low], a form 

alysis of the 

but poorly 
: ’ ator to follow 
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levelopment. 
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unnecessary. 
4 In the present paper T wish to show how the whole subject may be developed 


in a manner at once rigorous and extremely general. This treatment, which 
is thoroughly elementary, follows, in the deduction of Radakovié’s formulae, 


ca 
; a method which, while not essentially different from that of Radakovié , will, 
' it is thought, be found more perspicuous. I have also considered various 
7 
matters which are passed over without mention by Radakovic , for instance the 
“ case in which the damping is so great that the free motion would not be 
ey : ; 
vibratory. 
i 
* See for instance Lord Rayleigh’s Theory af Sound, vol. 1, § 47. 
t Wiener Sitzungsherichte. Math. Naturwiss Hlasse, vol. 114, ma (1905), p. 877. 
‘4 ~ See Horn, Gewihnliche Differentialgleichungen (Sammlung Schubert), §23. 
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ON THE SMALL FORCED VIBRATIONS OF SYSTEMS 
WITH ONE DEGREE OF FREEDOM 


By Maxime BOcHER 


THE ordinary treatment* of the problem of the small forced vibrations of 
a system with one degree of freedom produced by a periodic impressed force 
is open to two objections. On the one hand, the solution is obtained in the 
form of a trigonometric series [see formulae () and (13) below], a form 
admirably adapted to certain acoustical questions, where the analysis of the 
vibration into its simple harmonic components is required, but poorly 
suited to the more fundamental purpose of enabling the investigator to follow 
the vibrating system in the details of its motion, since it is in general not 
easy to infer the nature of a function from its trigonometric development. 
The second objection refers to the correctness of the process by which the 
solution is obtained, as it is by no means clear that the trigonometric series 
found really represents a solution of the equation of motion. 

The first of these objections was met by Radakovié t who gave formulae 
for expressing the forced vibrations in closed form [ef. (5) and (12) below]. 
The second objection may also be met by imposing suitable restrictions on the 
function f(2) which determines the impressed force.~ It turns out, however, 
that for the correctness of the final result these restrictions are unnecessary. 
In the present paper I wish to show how the whole subject may be developed 
in a manner at once rigorous and extremely general. This treatment, which 
is thoroughly elementary, follows, in the deduction of Radakovié’s formulae, 
a method which, while not essentially different from that of Radakovie , will, 
it is thought, be found more perspicuous. I have also considered various 
matters which are passed over without mention by Radakovic , for instance the 
case in which the damping is so great that the free motion would not be 
vibratory. 


*See for instance Lord Rayleigh’s Theory of Sound, vol. 1, § 47. 
t Wiener Sitzungsberichte. Math. Naturwiss Klasse, vol. 114, ma (1905), p. 877. 
t See Horn, Gewihnliche Differentialgleichungen (Sammlung Schubert), §23. 
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The problem of the forced vibration of systems with one degree of freedom 
when the impressed force is periodic is equivalent to the problem of solving 
the differential equation 


dy dy . 
= «2% — + = x 
(1) dy ” dx + By =f(*), 


or, more particularly, of finding the periodic solutions of this equation, where 
a is a constant positive or zero which measures the amount of damping, 8 a 
positive constant, and f(x) a real periodic function which we will assume to 
be finite for all real values of » and to have at most a finite number of discon- 
tinuities in any finite interval. 

We begin by considering the case where 


B>a*>0, 


so that the damping does not vanish and is not very great.* The general 
solution of the reduced equation (that is the equation obtained by replacing 
the second member of (1) by zero) is 


(3) ke-* cos VB — a? x + khye-** sin V8 — a? x. 
For the sake of brevity we write 
s= We] =e 2 


By the formula derived from the method of variation of constants (see for 
instance Forsyth’s Treatise, §66) we may write the general solution of (1) 
in the form 


(4) b(c) = hye cos ax + hye sin sx 


] i; 
+ ii Slee" 7-2 sin x(x — &) dé. 


We wish now, if possible, to determine the constants of integration, hy 
and /,, in such a way that $(2) is periodic. It is clear that no solution of 
(1) can have a period which is not also a period of f(r). Let us then denote 

* This is the case in which the free motion is vibratory, cf. formula (3) below. If 8 < a? 
we have for free motion the case of no oscillation illustrated by the dead-heat galvanometer. 
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by 2p any period of f(x) and inquire whether k, and k, can be so determined 
that ¢(x) also has the period 2p, i. e., so that 

hye~** +2) cos s(x + 2p) — hye~* cos sx 

+ kye~** +P) sin s(x + 2p) — hye ** sin sx 


“2p 


é : SJ (€je~ 2% +? —© sin s(x + 2p — &)dE 
0 
1 fete , 
* -[ S (E)em 8 + PO sin 8(x + 2p — E)dé 


8 2p 
— 5 [Ferre -# sin s(e — Bag = 0. 


The last two integrals here cancel on account of the periodicity of /(&), and 
the equation may then be written in the form 


COS 8x axe — e—*?* cos 2ps) — hy e~*?* sin 2ps 
Fie’ , 
— 5 [Fee r2e-® sin (2p — Bae 
+ sin sx {A e—*P* sin 2ps + h,(1 — e~*”* cos 2ps) 


1 (*? 
*- ST (Ee? —®© cos 8(2p — eae | oa &, 


A necessary and sufficient condition that this equation be fulfilled for all real 
values of x is that the coeflicients of cos sx and sin sx vanish. This gives us, 
for determining 4, and i,, two linear equations whose determinant is 


A=(l- e— pa)? + 4e—2pe sin? ps. 


Since, under the conditions we have imposed, this quantity cannot vanish, 
being the sum of two real squares of which the first is not zero, we see that 
there is one and only one determination of 4, and kj, and we readily find 


a Sees ~© [sin s(2 p — &) + e~??* sin s€ dé, 





ky — 


8 


lam = [r@ccere [cos 8(2p — £) — e~2?* cos s€]dE. 
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Substituting these values in (4), we find Radakovié’s formula 


~~ 


1 ” 7 2 rs . P 
(5) d(x) = = | T(E)? r+2P—! sin s(x + 2p — E)dE 
om Je 


ap . 
ne | T(Ejew se + —# sin s(x — E)dg 
si. 0 


2 [ syne sin a(x — §)dé. 
0 


e 


We have thus proved 

Tueorem 1. Uf the impressed force has the period 2p and the damping 
is not zero but is sufficiently weak so that condition (2) is fulfilled, there exists 
one and only One Forced vibration of period 2), and this vibration is give n 
by (5). 

We can now see that there exists only one periodic forced vibration of any 
kind. For, if possible, let $,(.c) and $,(2) represent two different periodic 
forced vibrations with periods 2 p, and 2p, respectively. Let 2p denote the least 
common multiple of 2», and2p,. Then both ¢, and ¢, have 2p as a period. 
This must therefore, as we have seen, be a period of f(2), and therefore, by 
Theorem 1, there can be only one forced vibration with this period, so that 
our assumption has led us to a contradiction. We now easily infer 

THeorem 2. Under the conditions of Theorem 1, there exists one and 
only one periodic forced vibration. If the impressed force is a constant, we 
have the degenerate case in which the function $(x2) which determines the 
vibration isa constant. Apart from this case, the periodic forced vibration 
has the same primitive period as the impressed force. 

Let us now consider the case in which 
(6) B«<a@ 
but where a and 8 are still supposed to be both positive. 

We will show that the analysis we have just used applies to this case also. 
It is true that in this case s = ¥8 — a? is no longer real, and the trigonometric 
functions must be replaced by hyperbolic functions throughout. In particular 
we get 

A = (1 — e—**)? — 4e—2P¢ sinh? p¥a? — B, 
and the reasoning by which we showed that A cannot vanish is no longer 
valid. The fact, however, is still true and may be established as follows. 
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We may write 
A = (1 — e~** — 2e—P* sinh p¥a? — B)(1 — e—2”* + 2e—” sinh P¥a? — B) 
= 4e—*?*(sinh pa — sinh pV¥a? — 8) (sinh pa + sinh py@ — B). 


Since none of these factors vanishes, A + 0. 

The remaining work which leads up to formula (5) requires no change, 
and formula (5) remains correct. If we wish, we may replace the imaginary 
quantity s in this formula by the real quantity Va? — B, at the same time re- 
placing sin by sinh. We thus get 

Tueorem 3. Theorems 1 and 2 remain true if condition (2) is replaced 
by condition (6). 

There remains the case 8 = a*. This case may be discussed along the 
same lines as the other two cases if we replace formula (3) by 


k, we-** + ky e-*, 


Since no special difficulty and no special point of interest presents itself here, 
we content ourselves with merely stating the result, that Theorems 1 and 2 
remain unchanged here too, provided in formula (5) the fractions 


sin s(x + 2p — &) sin s(x — &) 
Se gs ©”+~—r- 8 





are replaced by « + 2p — & and « — € respectively. 

We have thus carried through the discussion of all cases where there is 
damping.* The case a = 0 presents some peculiarities to which we shall come 
back presently. We wish, however, first to consider the question of develop- 
ing the function ¢(.c), which determines the periodic forced vibration, in a 
Fourier’s series 





cm ; 
a $(x) = 3 * >> («1 cos —— + 5; sin mr) ; 
“ i=] Pp P 


That this development converges absolutely and uniformly for all values of x 
to the value $(.c) follows from the fact that ¢(x) is not merely itself contin- 





*In fact our discussion also covers the cases, which do not present themselves in the 
physical problem with which we are concerned, in which a and § may have any real values 
positive or negative, barring as yet the cases in which a or 8 vanish. 
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uous but has a continuous derivative.* The coefficients a; and 4,, or as we will 
‘all them, following a suggestion of Hurwitz, the Fourier’s constants of $(2), 
might be determined by making use of the expression (5) for $(x). They 
may be determined much more readily as follows by the method of unde- 
termined coetticients. r 

By differentiating the series (7) term by term first once and then twice, 
we get new trigonometric series which we will call (7') and (7"). Although 
we shall not attempt to make any statement as to whether these series rep- 
resent the functions ¢'(x2) and $"(x), or even as to whether they converge, we 
do know that their coefficients are the Fourier’s constants of ¢'(x) and $"(2) 
respectively.t If, then, we multiply the terms of (7) by 8, those of (7') by 
2a and add the two series thus obtained to (7"), we shall get a series whose 
coefficients are the Fourier’s constants of $” + 2a¢' + 8¢, that is of f(.r).t 
If we denote the Fourier’s constants of (7) by y;, 6;, so that 


} is. imk | 
8 "= | F(€) cos — dé, i= (€) sin — dé, 
(8) -. Free a, Oe Se 


we are thus led to the equations 
% = ah, + Ba,, 
T 
aa; + Bb,. 


[hese are linear equations for the determination of a, and 4, in terms of the 
known quantities y;, 6;. The determinant of these equations is 


i 


l 
D, = (p*B — Prr*)? + 40 p*r*a" ‘ 


p | 


*Cf. for instance the writer's Jntroduction to the Theory of Fourier’s Series, these 
ANNALS, Vol. 7 (1906), p. 109, Theorem IL; also published separately by Harvard University. 

+ This follows from the fact that ¢ and ¢! are continuous, while @” is finite for all values 
of ¢ and has only a finite number of discontinuities in any finite interval. Cf. the paper last 
referred to, p. 116, Theorem I. 

+ Cf. p. 87, Theorems V, VI, of the paper last cited. 



























ON SMALL FORCED VIBRATIONS 7 





1908] 


This quantity does not vanish, since a ¢ 0, 8 # 0. Thus we get for a; and 
b, the values 


a;= an | (P*8 — imr*)y; — 2ipmad, | ; 


b, = —— {27 i 2B — inr?)d }. 
‘i =p, pray, + (ph — swt)s 
Substituting these values in the series (7), we get the Fourier’s development 
of (x). This development may, by a simple trigonometric reduction, be 
thrown into the form 


9 Lj = Yo . Yi 8S id — v;)- 8g al “Tm; l 
(9) $(@) = 98 + p> VDP — VD, p | ia | 


i=l 
where the constants .; are to be determined by the equations 


, ‘ — , na 
(10) cos iT %; = es, sin nana = — 
~p PVD; P Pv D; 
Thus we have established 
Tueorem 4. The function which represents the periodic forced vibration 
when the damping does not vanish can always he developed into the absolutely 
and uniformly convergent Fourier’s series (9) where y;, 6; are the Fourier’s 
constants (8) of the function f(x) which determines the periodic impressed 
force. This last function may be any finite function with the period 2 p which 
tnevery finite interval has at mosta finite number of discontinuities, and need 


not be developable into a convergent Fourier’s series.* 


The case of an undamped forced vibration (a = () still remains to be 
considered. Here the determinants A and D, become respectively 


—— 28 — f2er2)? 
(11) A = di sin® pyB, D, = FE a 
{ 
In order to insure that these determinants be different from zero, we must 
assume that pV8 is not an integral multiple of 7. In other words, we 


* This result also, and its proof, applies without change when a, or 8, or both are 


negative. 





ae Se a 


— 


— 


SS 


= a 
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assume that the period 2p of the impressed force is not an integral multiple of 
the period Ir /VB of the Sree vibration. Under this assumption our analysis, 
so far as it refers to Theorems 1 and 3, requires no modification. Formula (5) 
now easily reduces to 


~) 
- 





(2) g(x) = " £(&) cos VB (x + p — Ede 


2V8 sin ~EL 
D> va : 
| an T(&) sin VB (x — &)d€, 


a formula due to Radakovic. 
The Fourier’s development of ¢(2) also simplifies, for we see from (10) 
that in this case all the quantities 2; vanish, and the series for }(2) becomes 


i Yo yy { Pri wr ps, . 7x) 
1: = + < - ‘OS rt S . 
— o(7) 28 dma (p28 — ra? a p p?B — ex ” ph 


We thus obtain from Theorems 1 and 3 

THEOREM 5. [f there ix no damping and the impressed force hasa period 
2p which is not an integral multiple of the primitive period 27 VB of the Sree 
vibration, there exists one and only one forced vibration of period 2p,and this 
is qiven hy formula (12) and also hy the series ( 15). 


Theorem 2 obviously does not admit of extension to this case except in the 


special case in which the periods of the impressed force and of the free vibra- 
tion are incommensurable. 

In the special case in which 2p is an integral multiple of 27 93 itwill be 
readily seen by a reference to the linear equations for determining 4, and ky, 
that equation (1) has no solutions with the period 2p unless 


] 


2p ; “2p 
[ FE) cos VBEdE = 0, | S(E) sin VBETE= 0; 


but that if these conditions are fulfilled, every solution of (1) has the 
period 2p. 


Harvarp UNIversity, 
CAMBRIDGE, Mass., 
Jury, 1908. 




















TWO PRINCIPLES OF MAP-MAKING 
By J. K. WHItTTEMORE 


One of the most interesting applications of mathematics is to the problem 
of making a map of a part of a given surface, practically the most important 
surface being that of the earth considered as a sphere or as an oblate spheroid. 
The problem is essentially to establish a correspondence between points of 
the surface and points of a plane map, so that figures on the map shall 
resemble as closely as possible the corresponding figures of the surface. Since 
it is possible only for developable surfaces to construct a map on a constant 
scale ,* that is such that all lengths on the surface are represented by propor- 
tional lengths on the map, it has been generally admitted by mathematicians, 
and to a considerable extent by geographers, since the publications on this 
subject of Lagrange and Gauss,t that the best maps are those which preserve 
the shapes of small figures, and in which, consequently, angles are unchanged. 
Such maps are called conformal. 

Let us first consider the more general problem of mapping a surface S 
on any other surface S,. The equations of the two surfaces may be written 


S : x =f(u, v), y= (4, v), z= (u,v); 

MS: = f(r), nh = O(U, v), z= W(u, v). 
We may without loss of generality } suppose corresponding points of the two 
surfaces to be those obtained by giving to ~ and v the same values in the two 


sets of equations. If we denote by ds and ds, the “linear elements” on SS and 
S;, we have 


= a 2 
ds? = dx? a di + dz => de => (= du + = dv) 


Edi? + 2Fdudv + Gar, 





*See p. 14, below. 

t Lagrange, Sur /a construction des cartes géographiques. ‘Two memoirs presented in 1779 
to the Academy of Berlin. Ovurres, vol. 4, p. 637. 

Gauss, Allgemeine Aufldsung der Aufgabe die Theile einer gegebenen Flache auf einer ande- 
ren gegebenen Fliche so abzubilden, dass die Abhildung dem Abgebildeten in den kleinsten Theilen 
dhnlich wird; presented in 1822 to the Royal Society of Sciences of Copenhagen, as an answer 
to a prize question. Werke, vol. 4, p. 159. 

¢Gauss, loc. cit., p. 194- 


(9) 
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Cx 2 Cy\? ez\? cxr\? 
=) +(5") + =) = a) ’ 
cu Cu cu 3 Cu 
abet = (ae): 
hod CUE CH fot \CU 
Similarly, ds, = E, du? + 2F, du dv + G, dr’, 


r, 2 A ae a 

ROY £22) a 7 wal 

where £ => . P tet ——. Cin oe 3 
; cu . fed CH CV : wont \ CP 


It may be proved that it is both necessary and suflicient® for a conformal 
correspondence that, for all values of » and v 


BE - | = (y 
a” 7, * 3, 


If, in particular, E = Gand F = 0, so that 
(1) ds? = E(du? + dr*), 


and if we choose as S, the plane 7, = v, ¥, = ", 2) = 0, so that dsj = du? + dr?, 
we shall have, by this correspondence, a conformal map of S on the plane S;. 
It may be proved? that the linear element of any surface may by a suitable 
choice of the parameters, «and v, be thrown into the form (1) in an infinite 
number of ways, and further, if one such form is known, all others are obtained 
by writing 


u's iv’ = f(u + wv), 


where f(u + 7v) is any function of the complex variable u + 7v holomorphic 
in the region covered by the map. 

Let us consider as examples the conformal mapping of a sphere of radius 
one and of an oblate spheroid obtained by rotating on its minor axis an ellipse 
of eccentricity ¢ and semi-major axis one. If the center of the sphere be at 
the origin, and if + be the longitude measured from the «z plane and £ the 


* Gauss, loc. cit., p. 195. Goursat, Cours @analyse mathématique, vol. 2, p. 46. 
+ Gauss, loc. cit., p. 16. Darboux, Théorie des surfaces, vol, l, p. 145. 
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complement of the latitude of a point of the sphere, we have 


x=sin Becosv, y=sinBsinv, z=cos BP; 


ds? = d8* + sin? Bdv? = sin? (<F + dv?) : 
sin?g 








8 dp B 
L t “= —_—s —e 
. . | 28ing log tan 2 
Then we have 
l ; : Qu 
tan— = e", cos* — = ——_,, sin? = - E ? 
: 2 1 + e*™ 2 1 + e* 


; — 4" 2 2 
sin?8 = 4sin* = coe?” = ; = ( ) ; 


» =z 74 emu 3 eu + eu 
so that, finally 


» 2 
ds? =( — :) (du? + dv?). 
+ ¢ 


The point of longitude v, latitude 90° — 8, is now represented by the point of 
coordinates (uv, v) in the plane S, and the represention is a conformal map 
which may be used to represent any part of the sphere not including either 
pole. The map so constructed is the famous “Mercator’s projection” in which 
circles of latitude and longitude are represented by sets of parallel straight 
lines, and in which a rhumb line, crossing all parallels of latitude at the same 
angle, is also represented by a straight line. 


If we write 
ul + te’ = ett,” 
then is 
u+ iv = log (wv + iv’) = log p + 10, 


where p and @ are the absolute value and angle of the complex number 
u'+ iv’. We have 
u = log p, v= 8, 


‘ 2 2 . 
ds* =( : :) (du? + dv*) =(4¢ i) (dp* + p?d@). 


wae 


In the wu’, v’ plane, in which p and @ are polar coordinates, circles of latitude 
of the sphere (8 constant) are represented by concentric circles (p constant), 





a 
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and circles of longitude (7 constant) by straight lines radiating from the 
origin (@ constant). The conformal map of the sphere so constructed is that 
obtained by stereographic polar projection. 

To construct a map of the oblate spheroid obtained by revolving about 
its minor axis the ellipse 
¥ 


we ee 
l—é 


x? a 
let us, following Lagrange,* express the coordinates of a point of the ellipse 
in terms of 8, the complement of the geographical latitude, which is the angle 
made by the normal to the surface of revolution with the plane of the equa- 
tor. One may easily see from the properties of the normal to an ellipse that 

i , 
ctn 8 = single . whence we find 


x(1 —e*) 
sing _ 1 — ¢*)cosp 


yl — &cos*8 


HW = 


7 V1 —ecoss 


« 


If now v be the longitude of a point on the spheroid we shall have for its 
linear element 


ds* = dx? a dy? + ax dv? 


sin’? (1 — e*)? 
aa ( 


ds t dv). 


aa ye sin?8(1 — e? cos?8)? 


Let us write 


- . 7 
_ (1 —&)dp oe ( (1 + ecosBy\*? B 
" =| »sinB(1 — e?cos?8) log { G on sacat) - 45 


Then we have 
sin? 


df = ——.__ 
l — e cos*8 


(du? + dv*). 


The coefficient would naturally be expressed, as in the case of the sphere, in 
terms of wu, but such an expression cannot be obtained in any simple finite 


* Lagrange, loc. cit. p. 661. Lagrange and other,writers use the letter z for the comple- 
ment of the geographical latitude. I have used §, since in this paper z is used for one of the 
coordinates. 
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form. It is clear that these formulas reduce to those for the sphere when 
e=0. Lagrange remarks that if we write 


rae : =(j + ecosB\*?, B 


‘wer | Beet £ 


we have u = log tan ¢/2, and ¢ is an angle obtained by making a correction, 
which vanishes with e, in the geographical co-latitude. We have thus a map of 
the oblate spheroid in which circles of latitude and longitude are, as in Merca- 
tor’s projection, represented by sets of parallel straight lines. The transform- 
ation considered in that case would give us on the w’v’ plane a map of the 
surface in which these curves would be represented by concentric circles and 
lines radiating from the center. 

Having seen that any surface may be mapped conformally on a plane in 
various ways, we now consider the scale of the map, on which is to depend 
the choice of the best conformal map. The scale is the ratio of corresponding 
lengths on the map and the surface at a point. Denoting the scale of the map 
by m,, we have m, = ds,/ds. Suppose a second conformal map constructed 
on the u'v’ plane, S’, where 


u' 4 te’ = f(u + ir). 
On this map the scale m is evidently ds'/ds. We have 
ds? = E(du? + dv*), ds? = du? + du’, 
ds’? = du + dv’ = E'(du*® + dv*). 


Since 
du' + ide’ = f'(u + iv)(du + tidv), 


du’ = idv' = f'(u — iv) (du — idv), 


where f is the function conjugate to f, and /’ and sy denote the derivatives of 


f and f, we have 
, ho E' 
E' = f'(u + iv) f'(u — wv), m= Vz 
We have evidently 
log m= U — $ log E, 
where 


U= 43 log BE’ = 3 log f'(u + iv) +3 log f'(u — iv) 
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and is obviously a solution of 
AUV= 


being simply the real part of the monogenic function log f’(u + tv). More- 
over, a function f may be found such that U’ is any harmonic function, 
that is, a solution of AU = 0, continuous with its partial derivatives in the 
region of the first map. 

We may now see clearly that not every surface can be mapped on a 
constant scale, for, in the first place such a map must be comformal,* so we 
shall have 

A log m= — 4A log EL, 


and if m may be constant we must have A log # = 0, and this is not the case 
in either example considered. It is interesting and, for the purpose of our 
discussion, important to tind another expression for A log m. Gauss ob- 


tainedt an expression for the total curvature of any surface in terms 
of the coeflicients Z, F, G, and their derivatives which becomes, when 
ds? = E(du* + dv), 


ra S#, 1 Pek), (2 
=~ gm * sel) +(e) J} 


But we find 


EAE — ie 2 x (5) | 
Cu cv 


Ee 





A log E = 


Hence 
Alog H=—2EH and A log m= Eh. 


Since £ is always positive, a map of a surface can have a constant scale only 
when Av = 0, that is, only when the surface is developable. 

The construction of a second conformal map on a plane S’, has been 
shown to depend on the choice of a harmonic function of u and v on the map 
in the plane S,.. The problem which we shall now consider is the most in- 
teresting question in the theory of map-making, the choice of this harmonic 


* Gauss, loc cit., p. 194. 


t Disquisitiones generales circa superyicies curvas, presented in 1827 to the Royal Academy 
of Géttingen. Werke, vol. 4, p. 232. 
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function U, so that the map on the plane S’ may be the best possible map. 
We must be guided in this choice by a consideration of the variation of the 


scale of the map. Various criteria for the best conformal map of a region of 


a surface have been proposed. Of these I shall consider fully two* which, 
very curiously, starting from totally different points of view reach, in most 
cases, the same conditions for the determination of U. The first of these 
principles was enunciated in 1856 by Tchébychef, in a paper presented to the 
Imperial Academy of Sciences of St. Petersburg,t as follows: The minimum 
deviation of an integral of AU’ = 0 from the function 


log pu + e~ uw 


in a region bounded by a closed curve cannot occur unless 


2 : 
U’ — log as 5°3 
is constant on the contour. It is not entirely obvious what Tchébychef meant 
by the “deviation,” and his statement applies only to maps of a sphere. So 
far as I can learn he never published any proof of this remarkable theorem, 
though he applied the principle to the construction of a map of Russia. I 
believe that the theorem was first proved in 1894 by D. A. Gravé, a professor 
at St. Petersburg.t This proof I shall here present in a form so generalized 
that the principle may be applied to the construction of a map of a region of 
any surface in which the total curvature is throughout of one sign, thus 
justifving the application of the principle made in the following paper by Dr. 
G. W. Hill to a map of a region of an oblate spheroid. 
Another criterion was proposed and discussed by F. Eisenlohr § in 1870. 
He regards as a measure of the accuracy of a map the distortion of the 
geodesic lines of the surface. If we find at each point of the map the max- 
imum value of the curvature of the representations of all geodesic lines through 


* Two other conditions for determining U were proposed by H Weber in a paper pub- 
lished in Crelle's Journal fiir die Mathematik, vol. 67, p. 229, and cited by Eisenlohr. These 
conditions while apparently as reasonable as those of Tehéhychef and Eisenlohr lead to no 
simple determination of U and are consequently of comparatively little interest. 

+ Bulletin, vol. 14, p. 257; Oeuvres, vol. 1, p. 233. 

¢ Association francaise pour Varancement des sciences, 23° session, Caen. 1894, p. 196. 

§ Crelle’s Journal fiir die Mathematik, vol. 72, p. 143. This paper and Weber's are cited 
by Gravé, loc. cit. 
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this point we may determine Uso that the mean value on the map of the square 
of this maximum curvature shall be a minimum, and find, at least in the case 
when the first map in the plane S, is simply connected, that this condition 
requires that the scale be constant on the contour. It is clear that Eisenlohr’s 
and Tchébychef’s conditions are the same when the part of the surface mapped 
is simply connected and has a total curvature everywhere of the same sign. 
I shall here give a proof of Eisenlohr’s principle, supplying some details 
omitted by him, and emphasizing the fact, which he does not consider, that 
the proof applies only when the map in the plane S, is a simply connected 
region. 

We now take up the proof of Tchébychef’s principle. Let the region of 
the map in the plane S, be denoted by 7, and the contour of this region be 
©. The contour ( may consist of one or more curves and is considered as 
belonging to the region 7’, which is therefore “bounded.” Let (” be a single 
valued harmonie function in 7’, that is to say continuous with all its partial 
derivatives and satisfying the equation \U’= 0. We shall need the follow- 


* 


ing theorems* concerning LC”: 

a) U can have no maximum or minimum value at an interior point of 7. 
From (7) it follows that 

b) If U is negative at all points of C’ it is negative at all points of 7. 
From this fact it may be proved that 

c) If (vanishes at some points of C and is negative at all others, it is 
negative at all interior points of 7. 


d) If (is constant on Cy it has the same constant value at all points 
of 7, and consequently 

e) If Vand 0” are two harmonic functions in 7’ equal at all points of C, 
then since (7 — (” is harmonic in 7 and also on C, U and U” are equal at 
all points of 7. 





*See e. g. a discussion of harmonic functions in Osgood’s Functionentheorie, vol, 1 
543. 

+ Theorem (¢), on which the following proof depends, is, I think, not generally stated in 
treatises on harmonic functions. Its truth may be easily seen as follows: under the conditions 
of (¢) U cannot be positive at an interior point of 7: if at an interior point a 
we should have by the mean value theorem (Osgood, loc. cit., p. 544 (-"? Uds = 0, where the 
integral is taken around the circumference of a circle of radius Pp, wholly within 7, and having 
its center at «. 


, U vanishes, 


Since Vis not positive at any point of the circle, it is zero at all points of the 
circumference, and consequently, since it is continuous, at all points of C, contrary to the 
hypothesis. , 
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Grave states without proof the following theorem: If $(w) is, with its 
first and second derivatives, $'(u) and $’(~), continuous and single valued in 
T, and if @$”(u) is negative at all points of 7, and if also, on C, i | 
U— $ (u) = 0, where U'is harmonic in 7, then J — $(u) is negative at all 
interior points of 7’. Let us prove the more general theorem: 

If $(u, v) is with its first and second partial derivatives continuous and 
single valued in 7’, and if Ad is negative at all points of 7, and if also, on 
C, U — $(u, v) = 0, where U' is harmonic in 7, then U — $(u, v) is negative ti 
at all interior points of 7’. | 

Let V= U— (u,v). Then, at all points of 7, 


Se 


a 


AV =— Ad > 0. 


Since V is zero on C, if it is positive at any interior point of 7’, there is 
within 7’ a maximum value of Vo where necessarily 


9 r 


™ 
“> 


9 r 
- 


tare 


< 0, — < Q, 
cu 


| 


) 


” 


ac 
wate tia = 
a 


These conditions are inconsistent with the fact that A Vis positive at all points 
of 7. If at any interior point V vanishes, since no displacement to a neigh- 
boring interior point can cause V to increase, we must have at this point 


oV aV 
~ o—_ ~ =o 0, 
cu cr 
and then for the same reason i 
ila ner 
ey et 
_—~“s = 0 —_—s = 0, 
cu? or 


conditions again impossible. 
To apply these theorems to the proof of Tchébychef’s principle, let 


1” = logm = U — 4 log BE. 
Then is 
o(u, v) = $ log E, Ad = — EX. 


Hence, if the region of the surface to be mapped is without singularities and 
of positive curvature throughout, the conditions of this theorem are satisfied. 
This is indeed the case with any region of the spheroid. 
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Suppose now that U has been found so that, on C, Vis constant. To 
suppose this constant zero will affect only the size of the map, and be no 
essential restriction. Then suppose, on C, 


= log mw — {7 — 4 log E=0. 


By the theorem just proved V is negative at all interior points of 7. Let D 
represent the “maximum maximorum” of the absolute value of Vat all points 
of T. Then Dis the deviation of the map in the w'v’ plane, S’. The de- 
viation in Tehébychef’s principle is the difference between the greatest and 
least values of the logarithm of the scale, or the logarithm of the ratio of the 
greatest value of the scale to its least value. We may now prove that the 
deviation of Vis less than that of 14), where 


i, = U, —$ log EZ, 


and Uj, is any harmonic function in 7 not differing by a constant from U. 

Let the maximum value of }; on C bea. Then 1, —a has the same 
deviation as J), and this difference is at some point of C zero, at some points 
certainly negative, since (7, — U is not constant, but at no point of C’ positive. 
We may write 


V,-a=V+0,-a-U, 


and since V vanishes at all points of C, the harmonic funetion (0 — a — U is 
zero and negative but not positive on (. Then 7, — a — €° must be negative 
at all interior points of 7’, and since the same is true of Vy the sum, Vy) — a, 
which vanishes at some point of C, has within 7’a negative value numerically 
greater than V. Then J, — 7, and consequently J), have a greater deviation 
than V. 

We may note that a map of a region of positive curvature constructed on 
this principle has its greatest scale on the boundary. A similar discussion 
shows that a map of a region of negative total curvature at each point, con- 
structed on this principle, has a minimum deviation, and has its least scale on 
the boundary. 

We now take up the proof of Eisenlohr’s theorem. We make our point 
of departure two theorems concerning geodesic curvature : 

1. The geodesic curvature of a geodesic line of a surface is at every 
point zero. 
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2. If the surface is plane the geodesic curvature of any curve on the 
surface is the ordinary curvature. It may be proved that, when the linear 
element of a surface is given by ds? = E(du* + dv*), the geodesic curvature, 
1/p,, of the curve on the surface whose equation is ¢(u, v) = 0 is given by* 


ve ve 
cv 


cu 


1 1 | ¢ 


Pe (“" v (5) + (#) "? (3) + Gy 


Suppose a map constructed in the plane S’, whose linear element is given by 
ds"? = E' (du? + dv*), where as in the previous discussion 





m » 


ut ivi = flat iv) and £"= f'(u + iv) f'(u— ir). 
Let us introduce a variable, 7, defined by the equations 


a4 ch 
ev 


C v 





= COs T, = sin Tf. 


ae \@)-@ 


Then we may write for a geodesic line of the surface, whose equation is 
$(u, v) = VU, and for the curvature, 1/p, of its representation on the map S’, 
which has the same equation, 


= pln VE cos T) + = ° (WE sin *)t, 

{ Gu j 
1 1 ( 0 Ty ss ra) i: ) 
a r —_— E" ae (VE cos T) + ay (VE sin 7) ‘. 


If we perform the indicated ditferentiations, we may eliminate the derivatives 
of 7, and obtain the result 


E 1 ave’ 1 OWE. 
f —= Ov ET BJ im T+ is . : )sint . 
- “e (\VE" Gu VE du VE" ov VE ev 

* For a of this formula and the statements concerning geodesic curvature see Bianchi- 
Lukat, Differentialgeometrie, chap. V1. 
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The maximum value at a point (uv, v) of 1/p*, found by varying 7, is easily 
seen to be 
( 1 oy hk’ 1 ¢\y =) 
NV EE" ov VE tv J’ 


which we will write 1/r°, and which may be transformed into the expression 


ml (5 eu ma =)* (5. \ =) } j 


We have now the scale of the map given by 


E 
, and consequently 55 = 


kL 


so that we may write 


— 1 ¢ fem cm ) - a ( c log” m (s log m 
re Bnd | (ai) + (3; Di w ( a 


‘The harmonic function lis to be chosen so that the mean value of 1/r* on the 
map shall be a minimum. We must then make a minimum the integral over 
the map 


ie du! dv’ -{f74 fate ad -/f} (F log gay e log am) Ret 


Now we have log m = U — 4 log E, and £ is a given function of u and v so 
that any variation of log m is a variation of U’alone and consequently itself a 
harmonic function. Let any such variation be represented by g. Then a 
necessary condition that the integral be a minimum is the vanishing of the 
first variation of the integral so that 


Cc log me OF é log im OF 
/[E . x + ——gtwn of hie de ax @, 
ou ov ov 
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This integral may, by Green’s theorem,* be written 


: ; 
Joe m a - ds -[ [roe m-Aq-du-dv=0, 


where x denotes the exterior normal and the simple integral is taken around 
the boundary of the map in the uv plane, S;. Now since g is a harmonic 
function, Ag vanishes, and the condition becomes 


J toe m 5 - ds = 0. 


Cn 


We know that for any harmonic function the integral 


3, 
— ds = 0, 
on 


and from this and the condition found Eisenlohr infers immediately that log m 
must be constant on the boundary. This conclusion does not appear to 
me wholly obvious, but may be reached, when the map is simply connected, as 
follows: Let the harmonic function conjugate to g be p. Then we know 


oq Op 
cn so Gs 


and 
0g j op 
[ve m _* ds =| log m * ds 


_ [fe log m) _ “ é log “has. 


{ cs 8 
If the map on the plane S, is simply connected, p is single valuedt and 


[oe oe”) ds = 0. 
, Os 





* See, for example, Picard, Traité d’analyse, vol. 2, p. 10. 
¢ Osgood, Functionentheorie, p. 546. 
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In that case the condition becomes 


~ 6 log m 
| p = ds = 0. 


Os 


Again if the region is simply connected a harmonic function p may be found 


* 


taking any continuous set of boundary values,* and the variation q chosen as 
the single valued function conjugate to p, and consequently the condition 


requires that at every point of the boundary 


and m must be constant on the boundary. It is to be noticed that this con- 
dition is not proved sutticient to make the integral a minimum, and that 
consequently the principle of Eisenlohr is less complete than that of 
Tchebychef. 


\ 


* By the solution of *‘Dirichlet’s problem.” See Picard, loc. cit., vol. 2, p. 38. We must 
require further that the boundary consist of a finite number of ‘‘regular” arcs. 


HARVARD UNIVERSITY, 
CAMBRIDGE, Mass., 
JUNE, 1908. 








APPLICATION OF TCHEBYCHEF’S PRINCIPLE IN THE 
PROJECTION OF MAPS* 


By G. W. Hier 


THE question how to obtain the best map of a portion of the surface of 
a sphere or ellipsoid of revolution is certainly an important one. Tchébychef 
was, I believe, the tirst mathematician to consider this matter from a general 
point of view. His dictum is “A map is the best when, within its limits, the 
logarithm of the scale has the least possible oscillation.” It is here assumed 
that this function of the scale is a measure of its own importance. It may be 
objected that the function enjoying this property is not the logarithm. This, 
however, has no more validity than the objections made against the method of 
least squares in the treatment of discordant observations. The author evi- 
dently was induced to adopt his principle by the elegant transformation it is 
capable of. 

In the projection of the oblate ellipsoid of revolution on a plane, let x 
and ¥ denote the rectangular coordinates of the projected point, u the longitude 
and z the geographical colatitude of the point to be projected. Let ¢ denote 
the eccentricity of the meridian of the ellipsoid. ‘To avoid the consideration 
of a linear unit we assume that the ellipsoid has the semi-axis 1, and that the 
scale on the outer contour of the map is also 1. The multiplications to be 
made for other conditions are so obvious that they need no mention. As we 
are not restricted to the use of z and u for defining the position of a point on 
the ellipsoid, and as the relative properties of this surface and the plane, in 
respect to projection, demand another variable in the place of z, let us put 


] + €cos2\‘ 2 
v = log} | ———_}? tan 5], 
1 — ecosz 2 


where the logarithm is hyperbolic. While z goes from 0 to 7, 7 goes from 
—x to+ x2. Then, according to the theory of con forme projection, the 


connection between the variables “, y and «, vis shown by the equation 


*Sce the preceding paper. 


(23) 
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x+y —l=f(u+rv—1), 


f denoting any arbitrary function. The scale m is given by the equation 





Vl — & cos*z 





+ 4log f'(u + ry —1)4+ 4 log f'(4 — rv — 1), 


log m = log 





sin 2 





where /f’ denotes the derivative function of f.* For brevity let log m be iy 
denoted by Vand its first term by A, and the two following terms involving 






the arbitrary function by W, so that 
r= A+ W. 







In establishing his principle Tehébyehef compares it to the statical case 





of the convection of heat. But we may, with advantage, substitute for the 





latter the general principle of hydrostatics. 
Let 1, denote the greatest value assumed by Vin the portion of the 


















surface to be projected, and Vy the least. Then, in order that the map may 





be the best possible, VW) — Vj, must be a minimum. This is expressed in the 
language of the calculus of variations by 6( 1, — V,) = 0, the variation having 





reference to the form of the function f#. (The eriterion for discriminating a 


minimum from a maximum need not be noticed). i 





Now, to form the cognate problem in hydrostatics, let us suppose that, in 
the plane of ”, 7, the surface to be projected is covered with a fluid of infini- 
tesimal depth, such that the gravitational action of itself on itself has a po- 
tential 1. The fluid can be in equilibrium if Vis constant along the boundary 
of the surface. In case the boundary consists of several detached portions, 
the theorem still holds, but the constants belonging to each portion may be 
different. Appealing to the principle of least action for the condition of 
equilibrium, if J} is the maximum of the potential and V4 the minimum, just 
as before, 6( 1, — V,) = 9, the 6 having reference to the distribution of the 
density p of the fluid. Now Poisson's equation is 
‘'y 1” 

+ + top = 0). 


” s , 
ue cr 


ay ™ 
™ 
we 


* For demonstration of these equations (see the preceding paper) consult Gauss, Werke, 
vol. 4, p 193. The essential parts of Gauss’ investigation have been translated by Thomas Craig 
in his Treatise on Projections (separate publication of the U.S. Coast and Geodetical Survey, 
Washington, 1882). Atthe end of the supplementary volume of Boole’s Differential Equations 
will be found a brief treatment of the matter. 
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But, in the first problem, we have 


eV @V CA @A 


— Ss Se Tt ss 
ou? ov? Cu? Ov? 
Hence, if we make 
CA a oA 
47 p = —{ — ——— 
p ou? cv? J’ 


the two problems are rendered identical. 

Thus the transformation of Tchébychef’s dictum is “That map is best in 
which the scale is constant along its boundary ; it being understood that when 
the latter consists of disconnected portions the constant may vary from one 
portion to another.” 

The Techébychef principle demands that, with our conventions, V should 
vanish on the contour of the map. It has become the fashion recently to 
allow meridians and parallels to form this contour. Consequently it is pro- 
posed to treat here the simple case where the portion of the ellipsoidal surface 
to be projected is contained between portions of two parallels and two merid- 
ians. By counting the longitudes from the middle meridian, we can suppose 
that + wv are the limiting values of u; also let vp and », be the lower and upper 
limits of v; and 2, z, the corresponding values of z. The problem, thus 
limited, has the advantage that the contour is regarded as dependent on wu, v 
instead of x, ¥. 

A is stated in terms of z instead of v; the first should be replaced by the 
second ; but the equation connecting them is eminently transcendental, the 
infinite series in powers of & is complicated. Limiting ourselves to the first 
power, 


1 — €& cos?z 
= log cosh r + & tanh v(1 — 4 tanh v). 


log » ; 
sin2 ° 

It is readily seen that our problem is a case of what is known as the 
Dirichlet problem. In the mode of solving this, started by Fourier, it is 


necessary to suppose that 


i=@ 
1 — @cos*?z —~ -  , ,w(v— Up) 
her shi tee ES hei, 
. sin z fag v, — Uo 


A, B, N; being constants. A and B are evidently determined by the 
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equations 
At Banks © ee, A+ Be, = log SON », 
The expression 
We= 4 log f(u tev —1)+4 log f' (4 — ry yim |" 
satisfies the differential equation 
Cw 


cu 


A particular integral of this is W= ("V, where ("is a function of wu alone, 
and JV of » alone. By the substitution of this the differential equation 


becomes 


~~ = FeV, 
au? 
where ¢ is aconstant. Thus both (” and V are periodie functions, one to a 
real, the other to an imaginary period. Which is to have the real, and which 
the imaginary period, depends on the cireumstances of the problem. Here 
U is to have the imaginary and J’ the real period. Thus the particular in- 
tegral may be written 


Wo = VW cosh(cu + i) sin(er 4+ h), 


where V, k, Aare the arbitrary constants. As the equation is linear, a more 
general integral may be written as 


cosh(cu + kh) sin(er + h)), 


where the summation extends to all permissible values for c. But, in this case, 
W must have the same value when the sign of ~ is reversed, thus / = 0; 


also W must vanish when 7 = 7, or ,; this demands that ¢ must be an integ- 
ral multiple of 7/(v, — ,). Also it is lawful to add — A — Br to the expres- 
sion for W. All conditions will be fulfilled if we write 
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W=-—- A— Bu+ > M; cosh(écu)sin[ée(v — v9) ]. 
isl 
Here we have c = 7/(v, — vy), and A and B have the constant values pre- 
viously attributed to them. 
Thus the proper form for log m is 


i= 
a oe , _ 
log m = > {-¥,sin ic(v — vy) + M; cosh (éew) sin ic(v — vo) f 
{i=l 


Thus log m clearly reduces to 0 when v = v, or v,; and it will reduce to 0 when 
u = Up, provided that ./; is determined by the equation 


M,; cosh(icug) = — N;. 


Hence 


i=a 


log => [: a Seite | N; sin[ie(v — v)]. 


— cosh (teu) 
i= 
By equating the two expressions for W we have 


slog f’'(u + v¥ — 1) + 4 log f(a — vy — 1) 


im ie, ee 
= 


M, cosh(éev) sin [ie (v — vy) }. 
In attempting to derive the form of the function / from this equation some 
difficulties have been met with, and a method, suggested by a graphical treat- 
ment, has been substituted. The exposition of this is better understood in 
the elaboration of an example. 

For this let us assume 2, = 40°, 2, = 50°, uy = 5°. The elements of the 
Clark spheroid give logye = 8.916 3655 961. As showing the connection 


between the variables z and v we employ the equation 
> == . 2 eggs ~ Oe » 1 6 Da Se 
v = log tan 5 + &cosz + } et costz + § & cos’: + - ‘ 


Thence it results that 
Vy = — 1.00547 07238,  v, = — 0.75853 60483, logic = 1.10456 77942, 
= 12.72236 3332, ¢ x 1° = 0.22204 71288. 
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If s be an argument which goes from 0 to 7 when v goes from rp to vr, we have 
the following table of correspondence between v and 2: 


eae! 4a Ditference 


log ; 

? Sin - 
U0 — 1.00547 07238 40° 0.43994 0702 0.43994 0702 O 

w 98489 2834240 4: f 725% 42426 5941 42537 1276 —0.00110 5335 

23 96431 49446 41 32 39 .20) 40877 22064 $1080 1851 202 9587 

94375 7054942 20 4 62 OYB40 2 39623 2425 276 T7026 

92315 91653 43 O19 37TS83-. 38166 3000 331 1808 

9QO258 1275743 57 135 BO34B SAS 36709 3574 365 T9880 

SS200 S3s801 44 2414 SASTZ 404 BH252 4149 370 9500 

86142 5496445 37 2506 3342: doo AT25 373 0204 

84084 TH068 46 28 13 .4591 31094 1444 9 3238388 525 344 3855 

82026 9717247 20 3.9873 BOSSS 1761 BOSSL OST: 293 4111 

107 T9969 1827648 12 58 16544 29205 184 29424 H446 219 4562 

llr T7911 38937949 5 57 .BTTS 27845 S508 2797 TOZ21 — 0.00121 8712 


127 75853 60483 50 26510 74595 26510 74595 O 


The values of 4 and B are, “A = — 0.2719 4625, B= ~— 0.7080 1360. 


The numbers in the last column of the table are to be represented by a periodic 
function of the argument s, and we find 


0.00392 1901 sin s + 0.00008 6185 sin : 
0.00014 5846 sin 3s) + 0.00001 0693 sin 
0.00003 1124 sin 5s + 0.00000 3086 sin 
0.00001 0791 sin 7s 0.Q0000 1187 sin 
0.00000 4296 sin 9s + 0.00000 0443 sin 108 


0.00000 1185 sin 11s. 


The values of the common logarithms of the hyperbolic cosine of various 
multiples of c x 1° are given in the following table : 





eee 





oS - BS 8h: 


0.0106195 
0.0414879 
O.0899754 
0.1526127 
0.2258977 
0.3068064 
0.3929821 
0.4827065 


; 

9 
10 
11 
12 
14 
15 
16 
18 


O.ST4ATS1L5 
0.6683966 
0.7630127 
O.8582766 
1.0499092 
1.1460329 
1.2422634 
1.4349258 


The values of the latter factors of the 


c( 
Qe 
19 3 
38 4 


The scale along the contour of the map being 1.000 0000, the deviations 
of it for each degree of colatitude and longitude within the area of the map 


v - Vo) 


2’ 3”.44 
0 39 .54 


c(v — %) 
57° 27' 5”.44 
75 52 30.43 
93 57 58.82 


» terms of log m are easily derived 


2 


c( 


mw we nw Ww 


6277072 
.7241194 
.8205394 
.0133924 
. 1098226 
.3026865 
O991193 
Y1TLIRGBO 


U = Be ) 


111° 44' 32.24 


129 


146 


13° & 65 
24 42 .19 
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32 
33 
35 
56 
40) 
44 
45 
50 


~ = 
— 


Lt. 


— 





2.7848529 
2.8812866 
3.0741546 
3.1705884 
3.5563233 
5.9420586 
4.0384924 
4.5206616 


9.0028307 


c(v — %) 
163° 20' 3”.70 
180 


in units of the seventh decimal are given in the following table : 


~ 
~ 


40 
41 
42 
43 
44 
45 
46 
47 
48 
49 
50 





“a= Y° 
Q 
S20 
16882 
21685 
24485 
25377 
244355 
21625 
16758 
— 9669 


0 


u 





= ]° “w= 2° 

0) 0 
9676 — 212 
16618 15775 
21334 20226 
24088 22824 
24956 23646 
24041 22789 
21285 20200 
16505 15692 
9534 — 9097 


0 





0 


8307 — 
14155 
18112 
20418 
21152 
20402 
18114 
14119 

8240 — 

0 


w=4° uw=5° 
0 0 
H618 0 
11175 0 
14254 0 
16054 0 
16627 0 
16084 0 
14291 0 
11191 0 
6604 0 
0 0 
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The corresponding table for all values of uv, in the case of Species II of 
Lagrange’s projection, when the exponent is adopted so that the scale is equal 
along the northern and southern limits of the map, stands thus ; 


~ ~ 
y ~ ~ ~ 


10 0 $3° — 32283 16° — 36233 {i 
41 — 14032 44 36659 ‘7 31538 50 
42 — 24766 45 — 37956 48 — 23914 


The sum of all the deviations in the case of the map on Tchébychef’s prin- 
ciple is 0.075414: but, in the latter table the corresponding sum is 


0.1504656: thus nearly double the previous one. Ilence it may be judged 


how much the first map is to be preferred. 

The method of constructing the map on Techéebychet’s principle, here em- 
ploved, depends on drawing each meridian and parallel at intervals of 1°. In 
the first place it is necessary, trom the preceding table to derive the mean 
value of the scale in passing over a degree of longitude or colatitude. The 
rule adopted for finding this is to interpolate into the middle, and, differenc- 
ing the resulting table, add one-sixth the second difference. Between u = 4° 
and wu = 5° the change of differences is very rapid, and one must find means 
of estimating the mean independent of this rule. We have: 


VALUES OF THE MEAN SCALE ALONG DEGREES OF THE MERIDIANS 
z u— 0° - 2° w=3° us 4? 
40) 
41 
42 
13 
44 
45 


O.N994875 94949 9 7 «806849 96530 
S6H426 86632 87293 BRHT4A YON4Z 
SOOT SOS47 832 85717) 87168 
wO741 = 7 s S515 SOS9O) S475 
T4928 L, * 7663 THOS | BB477 
74044 249 THH4B TILOO &3549 
THS10 7 78356 SOKO9 B&4709 


8063- {y: SIS91 S738 87146 


17 
1X 
R657T6 ‘ie 87404 SS8H53B DOU4LG 


Y4994 95065 95284 95722 96563 





na 


VALUES 


x 





~ 
te 


41° 
42 
43 
44 
45 
46 
47 
48 


49 


Meridians 


69. 10934 


ONT11 
OR485 
O7251 
O6O1T 
O4784 
03551 
02320 
01093 


SD9OSTS 


0.9990227 


u=1° ux 
90524 
83206 83748 
78452 79147 
79647 76460 
74763 75622 


75698 
TNARS 
S5526 


= 
VO5T6 


Parallels 


55229 
16947 
e268 
26165 
13610 
ITY 
84045 
LOO984 
48370 
2.28180 


53.06389 


- 
— 


TOH502 
T9184 
85846 


GOT54 


Reduced to Chords on the Map 


0° 


69. 10934 
69,0971 1 
6908483 


69.0725] 


6017 


69.04784 
69.03551 
69.02320 
69.01093 
68.99873 


2° u 
91189 


84946 
80716 


80720 
S50O19 


J1L285 


According to the dimensions of the Clark 
along the meridians and parallels in miles are : 
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92504 
87195 
B5O98 
$1527 
80885 
81528 
85605 
87202 


YV2535 


spheroid the curved distances 


Oo 
10 
82 
50 
16 
83 


50 


. 





OF THE MEAN SCALE ALONG DEGREES OF THE PARALLELS 


u=— 4° 


95485 
92354 
90309 
89120 
88830 
89277 
N0282 
92407 


VOONT 


°° Ff 
32 31 
Od OS 
S1 680 
49 48 
lo 14 
82 81 
49 48 
1X 17 
q] a) 
‘1 70 


The reductions from the ares to the chords are very minute, 
33 units of the last decimal for the parallels and 5 units for the meridians. 


-- 


at most only 








32 


Desiring to construct the map on a scale of 100” to the inch, we have 
only to divide the numbers just given by 100 and multiply by the mean scale 
to get the distances between the projected points on the map. Thus is ob- 


tained the following table : 


DISTANCES ALONG 


2 u— 0° 
40° 
O.6907394 
4] ; ; 
H900332 
42 
6895036 
> 
HOS9ULLS4 
$4 ; i, 
OSSSTOS 
1 
OSNT4IN4 
46 . 
. HS87541 
47 ; : 
GBSRSO54 
48 
HRYLTS2Z9 
49 
; Hx96419 
450 


DISTANCES ALONG THE 


() 


40° Q. 


1] 
42 
43 
44 

15 
46 
47 


3) 
50 


w444 
O475 
5251 
Lt44 
SUS 4 
7765 
cash 
9160 
1965 


6467 


u 
115520] 
bo A2504 
1H20481 
4715942 
480] R57 
{SA7T1AS4 
1971901 
IODBODS 
9139752 
9225117 


5306356 


HILL 


THOG 
O930 
5932 
2270 
USS? 
Sod4 


S606 


2508 


6617 


- | u 


DjA?PO] 
1050 
29751 
16280 
O2249 
STHOS 
72502 
AHLOB 
{002 ] 
25314 


OH556 


THE MERIDIANS 


= 3° = 4° 


27 8533 


1X14 S451 


7233 9616 
BS 45 6703 
La 4 AGT 
£0352 424 


163 2902 
L094 ddd 
S248 4541 


H919 7500 


if 


DD?PO] 


$2041 


TH194 
aT190 
$0625 
2558S 


O6356 


PARALLELS 


) " t 
DH?O] 
455400 4ABDS 
31530 33722 
18385 21555 
O4648 O8R343 
9OIS4 94075 
T4807 TS8H5D 
OSH45 62029 
$1748 44428 
242400 285AT 


06356 06356 
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u= 5° 
0931 
Q708 
JS450 
.¢248 
6014 
ATI 
5548 
2317 
.1090 


Y870 
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We have now the sides of 50 quadrilaterals, which must be so disposed 
that they continuously cover the plane of projection, and moreover, must fit 
against one of the projected parallels. We take for the latter the one which 
corresponds to z= 50°. By subtracting the distances along the parallel of 
z = 49° from those of z = 50° we obtain the quantities in the first column of 
the following table, and, by comparing them with the distances along the 
meridians, we obtain the inclinations of two adjacent meridians at the begin- 
ning of their courses. By subjecting these to integration we have the 
inclination of each meridian to the middle meridian. Multiplying the sine 
and cosine of this by 0.5306356 we have the increments of the coordinates x 
and y which we denote by Av and Ay. Thus we have the following table : 


vw Differences Inclination Integrated Ax Ay 


1° 0.0085239 415 29°.46 0 0° 20' 44'.73 + 0.0032022 + 0.5306260 


2 83042 41 23 .52 1 2 11.22 V59S4 5305488 
5 82768 41°15 .21 1 43 30 .58 159749 9303951 
4 82110 40 55 .32 2 24 35 .85 223127 5301663 
5 80509 40 6.62 3 #5 6.82 285595 5298665 


Having now the coordinates of angles of the quadrilaterals which abutt 
against the parallel of 2 = 50°, all the quadrilaterals can be made to fill con- 
tinuously the plane of projection in determinate fashion. The rigorous 
following out of this procedure requires the determination of one of the 
diagonals of each quadrilateral, whence the position of the two unknown sides 
is derived. But the little curvature in the meridians enables us to greatly 
shorten the labor. Each meridian may be regarded as the envelop of all the 
circles deseribed from the known corners of the quadrilaterals just beneath 
it with the appropriate radii given in a preceding table. Thus it is easy to 
compute the inclination of the two sides of each quadrilateral which form 
parts of two consecutive meridians, and by addition to get the inclination on 
the middle meridian. Thence are determined the changes of the coordinates 
in passing along the meridian from one parallel to the following. The table 
shows these inclinations and the values of Ax and Ay: 





41'29''.46 
41 34.86 
41 45 .! 
41 59 .92 
42 16.86 
42 34 .6 
iS 33 .3: 
45 6. 
43.19. 
4326. 


+ .'6895963 —. 


HSO14600 


HSGoO]] 
HAY4TLO 
HSIING2T 


69068492 


+ .bSU2424 
HSSST27 
HSS6H55 1 
HSS85544 
HRRFHH2 
HRXHR2? 
6889014 
6892341 
HSDBRGHS 


HOOP2940) 


S5HG0OS 
Sh 1) } 
ROHUTO 


87206 


Ay 


— 2489955 
24ND] 
250721 
252931 
254170) 
256159 
2ATIDB 
259727 

261366 


202413 


In 


29 


Inc 


‘lination 


‘lination 


Ar 


2) 52".98 + 6894013 
GOS9VO387 
HSSTTINT 
BRSH561 
HSSH568 
GOSSTTOR 
HSOO],27 
HSUSTH2 
OS9ST33 


HIUOA3Z9T 


u=4 


Ar 


+ OSS055] 
HASHSAO 
HRSDIIG 
HSR4A7T05 
HRSS5ODO 
HSSH200 
HRRR1LOG 
GBS9OR97 
HSN4016 


HONG? | 


[October 


0166259 
166640 
167298 
168241 
169499 
170699 
171868 
178007 
174047 
174715 


Ay 


531046 
ZOD 


pepe 
edede 


336092 
33RR52 
341725 
344241 
346781 
349127 


SOOT17T 
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z | Inclination | Az Ay 
= = | - 
50° re 
3° 25' 10'.13 4+ .6887585) —.0411548 


49 | 

wa] 26 11 .05 6888681) 413655 
as | 27 34.99 = 6889739, 416533 
“| 29 11.98 6890770, 419847 
om, 31 9.23. 6891760 423840 
ad 33. 7.89 = 6892746 427881 
“ 34.51.29 6893762 431423 
os 36 33 ..69 6894776 434913 
41 38) 7 wl 6895803, 438120 
40 39 12 .55 GS9B886 440379 








By the addition of the values of Ar and Ay we arrive at the following 
values of the rectangular coordinates of the intersections of the meridians 








' and parallels : 
u = 0° ua 1° u = 2° 

) Lat. “ Y x 4Y = — y 
40° 0.000000 0 ~— 0.003202 0.530626 ~——-0.012801 1.061175 
41 0.689642 0 (6.692798 522303 0.702262 1.044549 
42 1.378825 0 1.381944 513967 1.391301 1.027885 
43 2.067720 0 2.070810 505597 2.080079 1.011155 
44 2.756474 0 2.759538 497183 2.768735 0.994331 
45 3.445223 0 3.448262 488712 3.457392 0.977381 
46 4.134093 0 4.137108 480180 4.146169 0.960314 
47 4.823211 0 4.826199 471589 4.835182 6.943127 
48 5.512715 0 5.515670 462944 5.524558 0.925827 

» 49 6.202748 0O 6.205663 454247 6.214431 0.908422 


445517 6.904971 0.890950 











50 6.893488 0 6.896352 





28775 
Jt LSOLS 
00801 
095544 
JT tO 
4726604 
161546 


LRO024S 


Logvloso 
1.566671 
1.541712 
1.516640 
AYIA 
4H5099 
40583 


.414588 


OS TOSS 
20045 
ALZST27 
2.117263 
SO5TA2 
3.494251 


182871 


022043 
USS 4355 
994550 


SYP20585 


JO7 0084S 
STH 106 
57274 


.146248 


HOTH05 
2? H1LO448 
2.509082 
2.527436 
2.485444 
2.445060 


2 100282 


NT1LbS2 aod .357130 


pad9482 1.388615 a.060772 So1276 DAY: 2.515629 


.220168 1.362479 6.250233 1.816363 


1.536237 6.940196 ] 


. is 
2.20982 4 


81291 , 98 | 2.225789 


which covering an 
area of about 700 miles by 500 miles, has a variation of scale of about a 


quarter of one per cent. The scale on the boundary is 300” to the inch. 


Such a map is interesting as being the best possible if we admit Tchébychef’s 
principle. 


4 ee l 0 i ’ ‘ 
tESEAU OF MERIDIANS AND PARALLELS CONSTRUCTED ON 
TCHEBYCHEF’S PRINCIPLE. 
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THE FOUNDATIONS OF TRIGONOMETRY 
By Artruur C. Lunn 


Introduction. In the existing literature of real analysis the purely 
logical introduction of the trigonometric functions, when done on the basis of 
theorems of analysis alone without appeal to any geometric setting to supply 
features of the proofs, seems to be accomplished essentially in two ways, each 
of which is interpretable in terms of either real or complex variables. 

According to the first method the functions sine and cosine, in terms of 
which the others are defined, are introduced by definition in terms of the series 

gy r y 


Sr) —_— 7 — a a. ao C'(«) —_ - 4 ; _ Ye oe 


» = - ° 
» a) ‘ b 


he 
_ 





Then the proof of the permanent convergence of these series shows the exist- 
ence of the functions for every value of the argument, and the theory of infinite 
series furnishes the basis of the proof that they possess the properties ascribed 
by elementary trigonometry to the sine and cosine of an angle whose radian 
measure is x. 

According to the second method these functions are distinguished as 
solution of the differential equations 


(2) S'(r) a C'(r), C"(2r) = — S(.), 
with the special conditions 
S(O) = 0, (0) = 1. 


These four conditions may then be viewed as a set of postulates, easily shown 
to be independent, involving as undefined elements the functions S and C, 
whose existence and unique determination are shown by means of the theory 
of differential equations. 

But these two methods are hardly as elementary as might be desired, 
implying as they do a developed theory, in the one case of infinite series, in 
the other of differential equations. Moreover, the connection with the intui- 
tional origin of the functions out of geometric constructions is so remote, that 
it is not easy to decide what properties of these functions must be proved in 
the critical theory in order to identify them completely as the analytic 
equivalents of the sine and cosine as understood in the practical trigonometry 
of lines and angles. The object here is to offer a supplementary point of view, 
admitting logical development of equal rigor, but based more directly on 
(37) 
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general theorems only of analysis, and more directly interpretable in terms of 
elementary trigonometric notions. 

Part I gives a set of postulates chosen suitably from familiar trigonometric 
propositions, with proof that they determine uniquely the functions sine and 
cosine and the constant 7. Part IT shows how the known expressions for 
sine, cosine, and angle, in terms of the tangent of the half-angle as auxiliary, 
may be regarded as definitions which lend themselves readily to a proof of the 
existence of functions satisfying these postulates. Part III contains a proof 
of the independence of the postulates, admitting the logical consistency of the 
preceding theory. Attention is confined to the case of real variables. 


I. Postulates and Proof of Uniqueness. It S(.) and C'(.r) be 
symbols of two undetined single-valued functions of the real variable «, and p 
the symbol of an undefined constant. With the admission, as part of the 
logical background, of such propositions of real analysis as will be specified, 
direetly or by implication, it will be shown that these functions and this con- 
stant are uniquely determined by the following postulates, which are thus 
sutticient, not only to serve as basis for the development of analytic trigonom- 
etry, but also in the sense that their justification by inference from diagrams 
would make the connection with geometric trigonometry complete. For 
brevity, P will be used to represent the set of all numbers of the form ip 2/, 
where /, / are integers. 


POSTULATES. 


S(rt yy = S(nyCl(y) + Clery Sly), fv, y are numbers of the 


(2) is continuous for every «. 
C'(.) is continuous for every x. 
U'( p E) = Ue 
S(p/2) = 1. 
('( LYY)= l. 
('(a) ts not negative for ee of the set Poouthe futerral 0. « « Pp -. 
If S(2) has a limit asv upprouches zero over the set of numbers 
pit", 2 herng "1 positive wuileger, that limit is vnity, 
The first: postulate is the addition formula for the sine, stated, however, 


only for angles which can be constructed from the angle p (straight angle) by 


successive bisections and additions: the seventh may be said to identify p/2 


a 


as the angular measure of a quadrant, whatever be the value of p; while the 
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eighth, suggested by the construction of the sequence of inscribed polygons of 
2" sides, is needed to determine the number pand thus fix the unit of measure 
of the angles. The first seven postulates alone determine the functions 
sin (7.0/p), cos (wx/p), leaving p arbitrary. 

Since the value of a continuous function at any point is determined by its 
values for the points of any set everywhere dense, it will be sufficient to prove 
| that the values of Sand C' are determined for every point of 2’, which is such 
aset. For the following proof of uniqueness then, unless otherwise stated, 
x, y will be understood to belong to 7. 

From postulates 1, 4, 5, we have 

(3) S(x + p/2) = C(x), S(p) = 9, 
so that 

C'(x + p/2) = S(e#4+p), 
which by the aid also of 6 becomes 

(4) C(x + p/2) =— S(x); 


hence, by substitution of x — p/2 for x, 


(5) S(x— p/2)=-—C(x), C(x—p2) = S(z). 
; These equations, with arguments on the two sides differing by p/2, give by 
f induction all of the familiar formulae needed for*+reduction to the first quadrant,” 


and also by substitution of « + y for x the addition formula for the cosine : 

(6) C(ety) = S(e#t yt p2) = C(e)C(y) — S(“)S(y). 
The values of S and C’ will therefore be determined everywhere by their values 
at the points of the set 7’), defined as the points of 7? which lie on the interval 
QO... p/2, or ‘first quadrant.” It may be noted that on account of the 
continuity assumed, all of the preceding formulae are immediately proved valid 
for any values of x and y. 

Now let 

o(x)= S(r) + C(x) ; iy 

then 


o(x+y)=a(x)a(y), o(2%) =a(") 


” 


and since o(p) = 1, the latter gives by induction o( p 2") = 1, where n is 
any positive integer, there being no ambiguity of sign in the square root 


because o is a sum of squares. Henee, by the addition formula for o just 
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given, and by continuity, o(.7) = 1 for every x; that is, 


S(x) +C(r) =1. 


On this account comes in a familiar way 


C(3z) = Cory. S(x) = 2 C(z) — - 
giving 
: » 3+ C(22z) 
(4) Cr) = 


» 


which, starting at # = p 2, determines ( suecessively for all points of the set 
p 2", the ambiguity of sign being removed by postulate 7. Then S is deter- 
mined for the same set by 
S(24) 

(8) WY = F0(r)’ 
where, as appears from the preceding determination of ('(.c), the denominator 
is not zero for any value of « concerned here. Then the addition formulae 
complete the determination of SS and C' tor all points of the set 7). 

To show that the constant p is determined, let 


—~ NY I; ). 


Then the recursion formulae sv, 
which Ly, = 2p: so that 


Here the numbers V7.1 J205.--. %, form a monotonic increasing 
sequence, since every ©, is positive and less than unity. But the sequence is 
bounded : for 


which gives successively 
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and therefore 


1 
M, < (1 i (1 1 (1 1 "Th = 1 
-5) -3) ~ gn — «se © | 


By a fundamental proposition due to Weierstrass this sequence has therefore 
a definite finite limit; hence the sequence of values of Z,, has a limit, which 








by postulate 8 is unity ; and hence p = 7, the definite number defined by 


7 jim Vu 
— =n 9 


ie A 


(9) 


where 


be| 





1 fod 1 
Ve VE Vb + ay 


i Ty . : ee 


H ——----- ’ 


V2 ve + avs V54 avd + dys 

The proof of the uniqueness of the system determined by postulates 1-8 
is thus complete. 

The particular set of postulates here used is of course arbitrary, and 
subject to endless possible modifications without loss of its desired character. 
One change perhaps desirable would be to state the addition theorem or func- 
tional equation only for acute angles, and to supplement it by such conditions 
as to make the values of the functions in the other quadrants determinate. 
Another method in the same spirit as the above would be to state a set of 
postulates involving a single undefined function such as to determine it as the 
tangent of the angle, and then to define the sine and cosine in terms of it, or 
preferably in terms of the tangent of the half-angle so as to avoid irrational 
expressions. 

II. Existence Proof. For the further development of the conse- 
quences of the above postulates the proof could run mainly along conven- 
tional lines, leading among other things to the equations (2) for the derivatives, 
and from these by Taylor's theorem to the series in (1). For the present 
purpose it may be noted that if ¢ be the tangent of the half-angle, defined by 


¢= S(2/2)/C (2/2), 








* Geometrically, VM, is one-fourth the perimeter of a regular polygon of 2"~?! sides, in- 
scribed in a circle of unit radius. 
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there would emerge the formulae 


2t 1-?# 
(10) S@)= pe CM= We 
while the angle x would be related to (by a differential equation integrable by 
quadrature : 
dt 


dx 2 


These equations suggest a simple means of proving the existence 
functions which satisfy the postulates of part I. For, let 


‘we dz 
(11) : I(r, y= | i 39 


- 2 


and let the number 7 be defined by 
(12) w= 2 1(0, wx), 


which is finite; then by the Archimedean property of the linear continuum, 
any value x of the fundamental independent variable (angle) can be written 
in the form 


(13) 2= E + 2rr7, 

where r is an integer, and & is a number, uniquely determined, satisfying the 
condition —7<& 7. Moreover, since the function under the integral sign 
is always positive, the integral /(0,/) is a monotonic continuous function of 
its upper limit ¢and ranges from — 7,2 to + wm 2 as (ranges from — x to + x. 
The equation 

(14) &= 2](0),1) 


for any &€ numerically less than 7 thus determines ¢ uniquely as a monotonic 
continuous function of &, and therefore of 4 for all values between any two 
successive odd multiples of 7. 

Equations (19) may then be used as indirect detinitions of functions S(x) 
and C(x), making these exist as single-valued continuous functions of x 


(postulates 2 and 3) for all values of x except those of the form (2r + 1). 
Continuity at these points and also satisfaction of postulate 6 are secured by 
interpreting the definition so as to imply also the limiting values S = 0 


, 
C=—1fort= +x: for these limits are independent of the sign of (as its 
numerical value approaches infinity, this sign depending on whether « is above 


or below the “straight-angle” value concerned. 
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Now the substitution z = 1/z’ transforms the integral Z(0, 1) into 
I(1, #), so that 7/2 = 21(0, 1) since = 2 }J(0,1) + Z(1, @){. This 
gives the related values: # = 7/2,¢= 1, and for these S= 1, C= 0, as in 
postulates 4 and 5. Moreover, as « ranges over the interval 0 - - - 7/2, ¢ranges 
over the interval 0 ..- 1, hence C is not negative there (postulate 7). 

If x is numerically less than 7, the mean-value theorem applied to the 
integral J(0, ¢) shows that x lies between 2¢ and 2¢/(1 + @), so that S(x)/x 
lies between 1/(1 +) and 1; as x approaches 0, ¢ must approach 0, 
therefore 

lim S(x) « 
x«=0 zr 
a stronger form of postulate 8. 

On account of continuity it is sufficient to verify the addition formula for 

angles x, y which are not integral multiples of a, so that they may be written 


(15) x = 21(0, t) + 2rz, y = 21(0, u) + 287, 
where ¢, u, are finite. Now the substitution 


iw of , z+t dz dz' 


_ oo "tan l1¢+2 142 





transforms the integrand function of 7(0, «) into itself: but with respect to 
the limits of integration a distinction must be made depending on whether the 
point of discontinuity z = 1/¢ occurs on the interval of integration; we have 
then 
I(t, v) when tu < 1, 
I(0, vu) =2 T(t, x) when tu = 1, 
Lae. 2x) + I(— x, v) when tu > 1, 

where 


t 
(16) ae 


~ fo oe 


and corresponding to these respectively 


x I(0, v) + 2(% 4+ 8) 7, 
(17) xe+y=< (2r+ 284+ 1) 7, 
F I(0, v) + 2(r+8+41) 7. 
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The intermediate case where éu = 1 may be treated by means of limits, while 
the other two show that if fu + 1, then v as detined is that value of the para- 
meter in the integral which corresponds to the sum of the angles. Hence 








* f+tu 
18 is - ~ Bh wos 2t(1 — wu") 4 Pu (l f*) 
-") me + a= ft+ur? (1 + @) (1 + wu?) 
a+ 7m) 
or 


S(x+y) = S(r) Cy) + C(x) Sly), 
as required by postulate 1. This completes the verification of the existence 
of functions satisfying the postulates. 

As an alternative method, the entire trigonometric theory could be based 
directly on the detinitions contained in equations (10), (15), (14), and would 
be equivalent to any theory containing postulates I-S as propositions, 
since they were shown to be sufficient for unique determination of the unde- 
fined elements. 


III. Proof of Independence. Let it now be granted that there exist 
in the realm of real analysis functions having the properties of the trigono- 
metric sine and cosine, and also, for brevity, that a certain amount of the 
theory of the exponential function, which occurs in two of the examples to be 





given, has been deduced in the usual manner: then it remains to be shown 
that the postulates of part I are independent. According to a recognized 
method this is done by the following series of examples, each of which gives 
a possible interpretation for the symbols S, C, p, such as to violate the 
postulate corresponding to the number annexed while fulfilling all of the 
others. Details of verification need not for the most part be set down. 

1 S(x)=2(1- dan zy; C(z)=1- se 


- p arbitrary. Here 
Pr P : 
G(x +3 q ' L (Mr) 8 wy _ aay ,. p-—? 
S(z+y) and S(xz)C(y) + C(x) S(y) differ by 7 12— - (x+y), 
so that postulate 1 is violated; all the other postulates are satisfied. 
2. S(x) = sin « for points of the set 7, but having arbitrary values 
differing from those of sin # for other values of x; C(x) = cos x for every x; 
p=r7. Note here that all the postulates aside from those of continuity refer : 
only to the points of the set P. 
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3. Similar to 2, except that it is C(x) which takes arbitrary values, 
ditfering from those of cos x, for values of « outside of the set P. 


sin 5a . - 
—,— 3; O(«4) =a* cos 5x2; p=the smallest positive 
od 


4. S(x) =a’ 
number satisfying the equation sin (5p/2) = 5/7 (approximately p = 0.318) ;: 
anda=7*. Here C(p/2) = V24. 
sin 2x 
ae C(x) =cos 2x; p= 


The postulate concerned may be said to fix the vertical scale of the sine curve. 


5. S&(x)= Here S(p/2) = 1/2. 


ro! 9 


sin 2w T ee 
> 3 C(x) = a* cos 2x3; p= at and a= 2’. This 


6 62) =e 


makes C(p) =—4. 
i. S(v) =sine: C(e) =cose: por. This brings positive and 
negative quarter-waves into the interval 0... p 2 for the curve of C(2). 


’ ° ’ Tv 
8. S(x) =sin me; C(r) =cos mr; p=—; where m 4 1. Here 
Mit 





m S(x m . 7 
lim (7) _ m. Forexample, ™ might be j 


. 7 80 that « would be measured in 
721) x ¢c 


degrees. 


CHICAGO, ILLINOIS, 
FeBruary, 1908. 





SUFFICIENT CONDITIONS FOR IMAGINARY ROOTS OF 
ALGEBRAIC EQUATIONS 


By Otrtro DUNKEL 


Introduction. The study of algebraic equations has led to the develop- 
ment of a considerable number of conditions, necessary or sufficient or both, 
for the existence of imaginary roots of such equations. The theorems of 
Descartes, Budan-Fourier, Sturm, Newton-Sylvester,* which afford such con- 
ditions, are well known. 

More recently Professor E. B. Van Vleck t has given a sutticient condi- 
tion for the existence of the maximum number of imaginary roots; and 
Professor Kellogg t has given a necessary condition for the reality of all the 
roots, which is allied to a special case of a more general theorem which it is 
the purpose of this paper to develop. It will be shown that several well 
known conditions can be easily derived from this general theorem, which we 
shall refer to as the Fundamental Theorem ; and in some instances sharper tests 
than those usually given can be obtained. One of the special cases of this 
theorem will be shown to be part of Newton's theorem on algebraic equations. 
The theorem itself permits not only of a generalization of some tests already 
known but also of the derivation of new tests. 


Fundamental Theorem. The general equation of the n™ degree : 
(1) fi vr) = por" . px" +. p,2*-* tere + Pn 42 + Pn — () 


will be considered ; but for simplicity in the statement and also in the proof of 
the theorem it will be convenient to write the equation in the form : 
1’ yn 4 ~n—l n!} pn—i 
(l') aye" + nae tee e + cmos yi +++++na,_\4 +a, = 90, 
(m—a)tel 





*Cf. E. Netto, Vorlesungen iiber Algebra, vol. 1, for the statement of these theorems: 
Descartes’ Theoreyu, p. 219; Budan-Fourier’s Theorem, p. 216; Sturm’s Theorem, p. 238; 
Sylvester’s Theorem, p. 225; Newton’s Theorem, p. 233. 

7 Van Vleck, ANNALS OF MATHEMATICS, ser. 2, vol. 4 (1903), p. 191. 

> 0. D. Kellogg, ANNALS OF MaTHeMaTICs, ser. 2, vol. 9 (1908), p. 97. 


(46) 
































(2) 





IMAGINARY ROOTS OF ALGEBRAIC EQUATIONS 


where 


— iti! 
inc soa ium @, 1,8, -- +0. 


n! 

The series of numbers (9, 4), 4g, - + - @, computed from the coefficients of 
(1) by means of the relation in (1') will be referred to as the binomial coeffi- 
cients of the equation (1). 

The Fundamental Theorem which we shall now prove may be stated as 
follows : 

If Ay, Ay, Aa, ++ + Ay are the binomial coefficients of an equation of the nt" 
degree; and, if any equation of lower degree whose binomial coefficients are a 
consecutive set of the given binomial coefficients, e. q.: 


/! 


t t—1 “ t— . — 
Oy + im 41% tr ee ty! ping a es fn 4 ¢—1* + An+¢=%, 


(¢- 


has imaginary roots; then the original equation has at least as many imaginary 
roots. 

The proof of this theorem follows very easily from two theorems which 
we shall state as lemmas. 

First Lemma. If any derived equation of the given equation (1), i. e., 

[‘(x)= =A 2 @, 

yl ss dx 
has imaginary roots, the original equation must have at least as many. The 
truth of this is very easily seen froma figure of the curve y = f(x) by the 
application of Rolle’s Theorem.* 

Second Lemma. If any equation has imaginary roots, its reciprocal 
equation has just as many ; and the converse is also true. Since the reciprocal 
equation is obtained from the original equation by the substitution» = 1/y, 
and its roots are consequently the reciprocals of the roots of the original 
equation, the truth of this second lemma is seen at once. 

The (n — m — ¢)™ derived equation of (1') can be written down imme- 
diately, since the result of the differentiation will be of the (a + 0) degree 


* Netto, Algebra, vol. 1, p. 208. The theorem is not stated explicitly by Netto in the 


above form, but follows from his Satz u. The analytical proof is easy. 
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with corresponding binomial coetticients, after dividing out a factor common 
to each term of the result ;* i. e., 


(3) @ox™r! + (m a t) wer +--+ (m+ t) Am+1—1% + Any e = 0. 
The reciprocal equation of (5) is 


(4) Omary™t + (m+ t) @..4¢—19"" t—1 +eoe + (Mm + thay + a, = VU. 


The m** derived equation of (4) is of degree ¢; and, if we neglect again 
a certain factor, it can be written : 


(5) Oe yt 4 ta, 4 r—1y' —1 fieee t tn Ly - t= ,. 


Finally the reciprocal equation of this last (9) Is 


t 


(0) ln! + 4,42" * toe ee tu ae 1a if. On +t —_ 0. 


Suppose now (6) has 24 imaginary roots ; then by the second lemma, (5) 
has 24; and in turn by the first lemma, (4) has at least 24; then (3) has at 
least 24°; and finally by the same reasoning the original equation (1') has at 
least 24 imaginary roots and hence our fundamental theorem is proved. 


Specializations of the Fundamental Theorem. In order to ob- 
tain special sufficient conditions for the existence of imaginary roots of (1) or 
(1'), we have merely to write down any selected number of consecutive 
binomial coetticients, form the corresponding equation (with binomial faetors) , 
and impose the condition that it have imaginary roots. We shall thus obtain 
a set of relations such that if any one is satisfied the original equation has 
imaginary roots. 

The simplest set of conditions of this kind is that in which three con- 
secutive coefficients are chosen and quadratic equations are formed. For an 
equation of the n degree there are (n — 1) such quadraties of the type: 


(7) a,_ 47 + 24.7 + a4, = 9, $= 1, 2.3 +++ p= j. 
If now for any one of these equations 


(8) a; A; 10,41 ° E 


equation (7), and consequently (1'), will have at least one pair of imaginary 


* Burnside and Panton, Theory of Equations, vol. 1, p. 6%. The differentiation of the 
general term of (1') with a subsequent slight change inthe form of the numerical factor shows 
this at once. 
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roots. If we return to the notation of (1) by means of the relation between 
the a’s and p’s as given in (1’), we shall have the result : 
If any one of the following inequalities is satisfied : 
. £€443 an € s I 
3) g-——— 


r n— st 





Pi-1Pi4i < %, i=l, 2, 3, . -n—Il1, 


the equation (1) has at least one pair of imaginary roots.* 

This result may also be stated in the following form : 

If the equation (1) has all its roots real, then the following n — 1 ine- 
qualities must hold : 
stin—s+} 


? GB < § 


' 2 . 
(8”) os loa Pi-1Pi4i1 = %, t= Ii, 2, 3, -++n—l1, 
The condition given by Professor Kellogg t corresponding to (8') would 

read: if for any value of ¢ 
Pi — Pi-1Pi41 =, ~=1, 3s 3,°°--n—l, 


then there are imaginary roots. In the cases in which either test applies, i. e., 
when p,;_, and p,;4, have the same sign, it is seen that (8’) gives a some- 
what sharper test. 

This method of obtaining sufficient conditions for imaginary roots can be 
extended. We could for example make use of a series of cubics : 


(%) a,x =o ba; 4 1x" TC 3a; +a a Ajig = Q, 


and, by imposing the condition that these cubics have imaginary roots, we 
would obtain a series of n — 2 inequalities such that, if any one is satisfied, 
the original equation has imaginary roots. 


Relation to the Newton-Sylvester Theorem. The condition (8’) 
is a special case of the theorem of Newton previously referred to. Newton's 
theorem may be stated as follows : 

From the coefficients of the equation (1) the following sequence of 
(rn + 1) numbers is formed : 


(10) po, Pi—h PoP» P2—l2PiPs +++ Pi —UPi-lPituy v0 + Pus 
* The case in which three consecutive coefficients are zero will be considered later; 


and it will be shown that in this case there are imaginary roots. 
+ Kellogg, loc. cit. 
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where 
4 CHF O— EST. 
oT sar 


then the number of real roots of (1) is not greater than the number of per- 
manences of sign in the sequence (10), the difference being even; and 
therefore the number of imaginary roots is not less than the number of varia- 
tions of sign in the same sequence. In particular, if any one member of the 
sequence is negative, the equation has imaginary roots; and this is precisely 
the result (8’). If all the roots of the equation are real, then the members 
of (10) must be either positive or zero. 

This theorem was given by Newton* without proof and remained without 
proof until Sylvestert proved a more general theorem, from which Newton’s 
theorem was obtained as a specialization, just as Descartes’ Rule of Signs can 
be obtained as a specialization of the theorem of Budan-Fourier. 


Derivation of Other Special Tests. A well known theorem in 
regard to the roots of an equation with one or more zero coefficients can be 
easily derived from the Fundamental Theorem. 

Suppose in the equation (1) 


(11) Pm+i = Pm+2 = ae = Dos; = 0, 


while p,, = Oand p,, 474, 4 %: then the equation corresponding to (2) takes 
the form : 


atti 4a 


An v T m+t+] 


0, 


If ¢ is even, this equation has ¢ imaginary roots; if ¢ is odd, it has ¢+ 1 or 
t — 1 according as a,, and a,,,,,, have like or unlike signs, or according as 
Pm and Py 4++1 have like or unlike signs. 

Hence, by the Fundamental Theorem, ¢(f any equation have t successive 
coefficients zero, there are at least t imaginary roots if ¢ is an even number; 
if t is odd, there are at leastt + lor t—1 according as the coefficients just 
before and after those vanishing have like or unlike siqns.t 

*I. Newton, Arithmetica universalis, Cambridge, 1707; second edition, London, 1722, 
chap. I. 


tJ. J. Sylvester, Phil. Mag. ser. 4, vol. 31 (1866), p. 214. For a proof of this theorem 
see E. Netto, loc. cit., vol. 1, pp. 225-235. 


;Cf. Burnside and Panton, Theory of Equations, vol. 1, p. 197, where this result is 
deduced from the Budan-Fourier Theorem; also Netto, Algebra, vol. 1, p. 221. 
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This result can now be applied in turn to the original equation (1) or to 
any one of the equations (2) derived from (1) in order to obtain other forms 
of sufficient conditions for imaginary roots. As an example of this method, 
let us multiply the left-hand side of (1) by the left-hand side of 


(12) e+ cn"—!4 car’? 4.---40¢,_ 2+ ¢,=0. 


We shall thus obtain an equation of degree (x + 7), which has the same 
number of imaginary roots as (1) provided that all the roots of (12) are real. 
Hence if we can choose a set of ¢’s so that all the roots of (12) are real and 
also so that two successive coefficients of the resulting equation are zero, i. e., 
so that for a special value of /, 
(13) CePi t Cr—-1Pi4n Fee + VPi4er—1 + Pig = % 
| Cy Piga + Cri Pig¢e tee + Pitre + Pigrgr = 93 
then the new equation, by the theorem above, has imaginary roots; and con- 
sequently the equation (1) must have imaginary roots. 

For r = 1, all of these conditions are satistied if 


Pi Pi+i 
Pi+1 Pi+2 


—(: 


hence: 
If three consecutive coefficients of cn equation are in geometric progression, 
the equation has imaginary roots." 
For r = 2, (12) and (13) lead to the result of Professor Kellogg t that if 
the quadratic 
| x? aL 1 
(14) | Piss Pi+i Pi 
Pi+s Pi42 Pi+i 


Il 
S 


has real roots, the equation (1) has imaginary roots. 
The special case in which (12) becomes (« — 1)*, and (13) becomes 
Pi — 2Pi4 + Pi+2 = %; 
oP 2Piae+ Pi43 =% 
* This also follows directly from (8’), which is satisfied whenever the three coefficients 


are in geometrical progression. 
tKellogg, loc. cit. 
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oa 


vives the theorem of Hermite * ; // four successive coefficients of an equation are 
in arithmetic progression, the equation has imaginary roots. 

A result analogous to (14) may be obtained in a different way. Con- 
sider any one of the equations (2) and replace « by y + % and apply to the 
result the condition (8); for simplicity, consider the cubie (9) which becomes 


after the substitution : 
(15) ayy + 3b,(h) 7 + 36,(h) y + o3(h) = 9, 


where 


: i(j — 1) 
p(k) = ah! +ja;4, —' , E »! Opp bh 


+ 
— 


If we ean find a real value of / so that 


,(h) o(h) 
$;(h) d3(h) 


then by (8) the equation (15) has imaginary roots and therefore equation 
(1') has also imaginary roots. After a simple reduction this inequality 


becomes: 


(16) A;.,h + 19 ash + O44 


Qj+4 h . Aj42 


This inequality can be satisfied if the determinant on the left has real and 


distinet roots, i. e., if 
O54 2 a; . Ul; +2 a; +] | a; +1 a; 


(17) |—4 > 0. ¢ 


Qi4g 441 | G43 Apis G42 Aya 


Tf then (17) ts true, the original equation has imaginary roots. 
. / ‘ iv) 





*E. Netto, loc. cit., p. 225. This theorem also follows from (14). 
+ See Burnside and Panton, Theory of Equations, vol. 1, p. 68. 
t This reduces to 


] , ' , 
a? [ (4, .,a? — 3aa, _ ,a, 9 +- 2a; i+ 4(aa,.,—a7,,) | 0, 


in which the square brackets enclose the discriminant of the cubic (9). This then is the 
best test of this kind which it is possible to obtain. 














ey ia, 
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Combination of Van Vleck’s Method with the Fundamental 
Theorem. A method similar to that employed by Professor Van Vleck * 
can be used to derive from the Fundamental Theorem another form of suffi- 


cient condition for imaginary roots. 
Given an algebraic equation of degree k, written in the form (1); then 
if / is even and greater than two, and if 


(18) qi = 240 qa < 0, Ti-1 _ 24: —2 dk < 0, Vi — Yi-1 Vi41 < 0, 
= 3,5,7,---k—3, 


all its roots are imaginary.t For from (18) it is clear that none of the coefti- 
cients with even subscripts can be zero and also that they must have the same 
sign. 

Writing the equation in the following way : 


q2 os 


(19) ak—* (u2* +t eit; (ae* + 2qge + 1) 





awk —6 


+ > 4X" + 25x + 0) t+ere t+ 5) (Ge — + 247 — 3% - 1-2) 


= (t= eal VE an ye a 1) = 0, 


it is seen that the expression can never vanish for real values of x; since from 
(18) the quadratic expressions in the brackets cannot vanish for real values of 
x and for such values have the same sign (that of the coeflicients of even 
subscripts ). 

Let us consider now one of the equations (2) derived from the original 
equation written with binomial coefficients, supposing ¢ to be even. If we 
apply the result above to this equation, we shall obtain the following result in 
regard to the roots of (1') : 

If any consecutive set of coefficients of the equation (1') satisfy the fol- 
lowing inequalities : 


* Loc. cit. 
tFor k = 2, replace condition (18) by gj — 4q,q. < 0. 
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t—1 
it ie a ded nlm +2 < 0, 
i t—i 
(20) aes, aS ae Sea lm +i-1¢m+i41 < 0, 
i+1t—i+1 
‘ t—1 i - -« + , 
ee i eee, ag a Om +t4m+t—2 < 0, 1=3,9,7,++--t—3, 


then the equation (1') has at least ¢ imaginary roots.* 

This would not afford a convenient test, owing to the dependence of the 
numerical coefticients upon the number of inequalities in (20). This difficulty 
‘an be removed by a slight sacritice of sharpness in the test; for in no case 
will any of these numerical coetticients be so small as 4.f We have then the 
following result : 

If any consecutive set of coefficients of the equation (1') satisfy the fol- 
lowing inequalities: 


(21) Oni $ On4i-1 Ansig¢i <9, t= 1,3, 5,---¢—1, 


the equation (1') has at least t imaginary roots. 


UNIVERSITY OF MISSOURI, 
Co_tumBia, Mo. 


*For ¢ = 2, replace (20) by a@,., — 4,,4,,.. < 0. 


+ In fact they are greater than or equal to 9/16 and this fraction could be used in the 
inequality (21) instead of the 4. 


RPE RRR Rn mre iinet 








CONCERNING A COMPOUND DISCONTINUOUS SOLUTION IN THE 
PROBLEM OF THE SURFACE OF REVOLUTION OF 
MINIMUM AREA 


By Mary E. Sincvair 


Introduction. Among the minimum surfaces which can be represented 
by liquid films is a surface of revolution across which is a transverse plane 
circular film.* It can be constructed by placing one on the other two equal 
rings, dipping them into soap solution, and then drawing them apart always 
parallel to each other and perpendicular to an axis through their centers. A 
longitudinal section of the surface shows on each side of this axis a system of 
three curves meeting in a point, two of the curves extending respectively to 
the two rings, and the third extending to the axis. The experiment suggests 
the following mathematical problem : 

(riven two points Ag and A, in the xy-plane and on the same side of the 

v-axts, Soin the points by a curve confined to the pos- 

A. — ttive side of the x-axis, and from an arbitrary point 
P, of this curve draw a curve to the x-axis. To find 
among all such systems of curves that one which when 
revolved about the x-axis shall determine a surface of 

x revolution of minimum area, All curves which we con- 


M sider will be of class D'.t 
Fic. 1. 





The object of the present paper is to give a solution 
of this problem. The discussion is divided into three parts 4, B, C, dealing, 
respectively, with the necessary conditions for the solution, the sufficient con- 
ditions, and the character of the field of extremals. The principal results ob- 
tained are as follows : 

1. The solution must consist of two catenaries 21) A,, A, A), with the 
a-axis for directrix, and the normal A, .V from A, to the x-axis. The cate- 
naries must meet the normal .4, .V under an angle of 120°. (A, §§ 1-3.) 

2. Onthe are A, A; there exists a critical point Aj, a limit on the left 





* See Platean, Statiques des liqnides, for a general discussion of minimum surfaces which 
can be represented by liquid films 

+ Bolza, Lectures on the Calculus of Variations, §2. A curve is of class D! if it is con- 
tinuous and consists of a finite number of arcs each of which has a continuously turning 


tangent. 


(55) 
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for the position of .f,: on the are .1, 4, there exists a conjugate point 21), a 
limit on the right for the position of wf. For the latter point there exists a 
simple goometric construction which we shall call the Lindelof construction for 
our problem.  (.b. $$ 4-H.) 
3. The above are not only necessary but also sufficient conditions fora 
relative minimum (77). 

tL. The solution is unique within a field defined by the one-parameter 
set of extremal-svstems passing through .f,, defined in Labove. CO. $$ 1-3.) 

>. ‘The field consists of two parts in which the area of the surface of 
revolution given by our solution is respectively greater or smaller than that 
given by the solution found by Goldschmidt.” (@, $4.) 

6. Experiment verities the results defining the position of the conjugate 


point.7 (CO, S$.) 
A. Necessary CONDITIONS FOR THE SOLUTION, 


1. The system consisting of segments of two catenaries and 
one straight line. We -uppose a totality 2 of svstems of three curves 
of the form « = r(/), ¥ = y(/), which, as indicated above, 

1) are of class 1), 

2) lie in the region 7 — 0, 

5) meetina point 7’, joining It respectively to the given points Ay 
and Aj, and the x-axis. 

We suppose further that a particular sv-tem through a point u1,, of which 
we shall denote the segment wf, .1, by &,, the segment A, al, by &,, and the 
segment joining Jf, to the w-axis by &, actually furnishes a relative minimum 
for the integral 


/ -| 2a . y” ts: | 2ary\ yr” y” dt A. | Jory” + ¥ : dt, 









where «and 7 are the derivatives dz df and dy dt, 


* Goldschmidt, Prize Exsay, 1830, 

+ After the completion of this paper, two papers by H. Tallqvist dealing with the same 
problem came io my attention: ‘Determination experimentale de la limite de stabilité de quel- 
ques surfaces minima,” Transactions of the Scientitic Society of Finland, vol. 32 (1889): and 
Bestimmung ciniger Minimealtachen deren Begrenzung gegeben ist, Nelsingfors, 1890 Tallqvisi'’s 






results concerning the limit of stability, obtuined by an entirely different method, agree with our 
own results in the special case of equal rings; they are wrong in the case of unequal rings, as 
will be shown in ©, §5. which I have added for the sake of comparison. 
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We vary each of the segments G,, G, ©, separately, leaving the remain- 
der of the system fixed. By the usual theory, each of these curves must in 
the first place be a solution of the Euler equation 


Fry — Pay, t+ Fy (a’y" —2"y') = 9, 
where F = 2ry Va 4 y", 
and therefore for our problem must be either a catenary,* 
x=at+ B, y=acht, 


or a vertical straight line, 
x= 4a, y=. 
In particular &, and &, must be catenaries, and &, must be a straight line, 


since no catenary of the set given above touches the «-axis. Tle points 4; 
have coordinates («;, 4;), and we write 


° = agl + 3, ’ Ay = Aol, + Bos 2 = Aly ’ 
Y: Po where |= 4 Ro and {tz = aot: + By 
y = agelif, (4) = ayeht,, (4, = agchty: 
(1) — ; 
7 x=a,t + B,, eo = @T2 3), t, = a,7T, + 8, 
Q): | “! where % - ae 4 “and 4 e a 
Y= a,chr, ( dy = Qy ch To, (OQ = ay ch 713 


UW, : SS = (ty. 

Further, ©, ©, ©, must each satisfy the conditions numbered by Bolza 
I, 11, U1, and 1V,¢ which are necessary for a minimum. The stronger forms 
Il’ and IV’ are always satisfied} since along the system of curves under con- 
sideration 

; ‘ Py 
By(ey cosy siny) =- s-=Y> 0, 
: y'2 : 

We may also immediately require that condition TIT be satisfied in the, 
stronger form III’, by the ares &) and &, since the envelope of the one- 
parameter set of catenaries through 4, has no singular point, and therefore 

* Bolza, loc. cit., p. 153. 

+ Loc. cit., §25-28. 
> Bolza, loc. cit.. p. 146. 
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the are © from «A, to its conjugate .4,' could never furnish a minimum.* 
A similar remark holds for &,. The two catenaries and the normal to the 
x-axis, which form the system, must therefore separately satisfy all the conditions 
Sor fixed end points. 


2. The integral as a function of the coordinates of ?:, the 
corner point. We shall apply the following theorem :t 

Let My(¢= ty. 2 = to. y = by) and M(t = 4, 2 = 4, y = 4) be two non. 
conjugate points on any extremal. Then it is always possible to construct 
through the points 2y)(2 = 1. y¥ = yo) and P\(x = 1, y = y,) in the neighbor- 
hood of Wy and J, respectively, a unique extremal, over which the integral 
is a single-valued, continuous function of the coordinates of Py and P, 


J (0. Yoo Tis Yi)s 


with continuous partial derivatives in the vicinity of (a), 4), a, 6,). Further, 
the total differential of J is ¢ 


AI (Ly. Yor Br i) = Felt ty tie Wi) dey t+ Py (ay he 2 yan 
— Fy (4o5 Yor Tos Yo) ty — Fy: (205 Yor Xr YoYo 
For our problem, therefore, 


(2) d J( Los Yor “15 “) 


Wy WY\ Yoh Yofi 
7 2! ao + SSS ln - 2 = ley — >! 0 
| Vi 4 Vey + Hy Vuly + Yo Vip + WJ 


For the extremal-system, ©, G, ©, the integral J is completely deter- 
mined in terms of this function J, its value being 


(3) I(&, &,, &,) = ST (My, bys Os, hs) + S(t, bs, yy b,) + 7 b3. 


Suppose P;(x2, 2) any point in the vicinity of A,. Then if the catenary 
through A, and P, be &, and that through /?, and A, be &, we have 





* Bolza, loc. cit., p. 204. 
+ Bolza, loc. cit., p. 174, footnote 1. 
$ Bolza, loc. cit., p. 176. 
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& . t= dot + By, 
ail y = agcht; 
(4) is | = a,T + Bi, 
is y = ayehr; 
G, : t= Fay 


with equations similar to those given under (1) at the end points. Over this 
system, the integral J is a function of 7,, 4,, namely : 


(5) L(Goy Gry Ex) = PC Hay Y2) = A (os bas Hay fx) + Ay Yor Ge by) + TY 


It is now necessary that the function @(~,, y,) have a minimum for the values 
Hy = Ug, Yo = 43. By the ordinary theory of maxima and minima, the follow- 
ing conditions must be satisfied for 7, = a, y, = b,: 


(6) cp = 0, op = 0, hs A S 0, (<<) - (<5 ) (; dl 
CH, CH2 CLAOL CLIC WY CHO Ly CH2CY2 


IIA 


3. The corner condition. We first discuss the conditions on the 
first derivatives of @ with respect to x, and y,, where x, = ¢, and y, = 4, 
From (2) and (5), we have 


ch Yo" , Yrs 


— = 25 >; 
Cx \ 2 72 
~~ tat hz 


oh Yaa Yor 
-¥ -7 V2 . \ ~y —_ 
Cy2 + 2 ty + 2 


where x}, y; refer to & and Z, 7 refer to G. From (4), we see that for 
%y = a, and y, = by, 


’ ’ - V 2 2 
y=, Ya = ag sh, — Ths = Te 
and 


-;2 


x; = Qa), ys = a; sh T 2,5 Ve + Uy = Ya 


Hence at the point (a, 4.) we must have 


| » 


= 277 (ao _ a,) = 0, 


(7) 


ta) 


ay 
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and hence 


(8) @y = Q). 
Moreover, (64) requires 
(9) 7 = 27a,(sht, — sht, + cht) = 0. 
Oe 
But b, = agcht, = a, chr, 


and therefore 


Using the upper sign, (9) becomes impossible, since ay + 0 and ch ¢, 4 0. 
Hence 

(10) t, = — To, 

and the equations 


(ly = Ayl, + Bo = a7, + Py 
give 


(11) a, = 4(8, + 8). 
Condition (9%) now requires 
2+ cotht, = 0; 


from which, if we represent by p, the slope of ©, at A, and by w, that of G, 
at A., we have ; 


l 
t, = — log y3, Pp, = shé, = — nF . 
(12) | 
R= log V3, wT, = sht, = + T= 6 
Vo 


We obtain from equations (8) and (12), therefore, the result that the vertices 
of & and &, must he at the same distance Srom the a-asis, and that the three 
curves Sy, Ej, E, must all meet at an angle of 120°.* 


4. Critical points. We shall now discuss the remaining conditions 
of (6). 


* Physically, a result of equal tension; Tallqvist, Determination experimentale ..., loc. 
cit., p.7. Compare also Platean, loc. cit., vol. 1, §173, p. 294, where the same result appears 
for a system of two equal spherical surfaces and their plane of intersection. 
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We obtain partial derivatives of do, d,, t, 7, with respect to x, and y 
from equations for the end points of ©, ©, similar to those given under (1). 
Kvidently 
Sg —-%= Gy (1, — é), 
(13) h, == ay ch/,, 
Yo = aych i. 


Differentiating (15) with respect to x,, and putting 


2 = Az, Yo = bs, We 
obtain 
Cay of, ct, 
] —= « (fy —_ fy) . ay (<= _— 2 , 
Cry Cn, Cr, 
a c 
Q=;5 cht, + Ay shf, — ’ 
Cry Crs 
Cay Ch, 
QO= .— cht, + aygsht, — 
( Wy c nap) 


whence, 
(do 1 


Gry ¢; — t, — cotht, + cotht, 

We shall use the abbreviation y(t) = coth ¢ — ¢. 
Then 

Cay 1 
(14) — = ——_—__—__ 

Cry x(t) — x(t2) 

Differentiating (13) with respect to ¥,, and putting 72 = a), yy = 4:, we 

obtain 


Ca Cty ct 
v= > (t, _ fo) +> a, (— eas *), 


ar Cy Oy 
Cao Cl 
0 = .— cht, + apsht, —, 
C2 Cy 
Cay Clo 
l= Py ch, a a,shty —— 3 
Cs CH2 
whence 
C 1 @a 
(15) a Te Se. PB 
X(fo) — X(4) shh, shty 6X2 





foro we Se 
os kee 
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Similarly, we obtain 








Cay l 
ry X(T) — X(T)” 
(16) 
Cay 1 Cay 
ty + + +sht;, ¢ 
Further, 
ct, 1 F cht, Cay 
ae re 
(17) 
CT; : cht, Ca, 
= — 1+——- |}. 
Cy, = =ayshab sh 7, =] 





Making use of (15). (16), (17), we now obtain the second derivatives at 
the point (,, 4.) 


Oe ee 
PyCY. » ¢ TOF 


~~. = 2m (1 + cotht, — coth t + in ote l an) 


CHaC 42 sh ty CL, sh T2 Cry 





Condition (Hc) therefore requires 


ay (= C ) wh. ] . 1 > oO 
”? er, Or) X(t) — XL) X(T) — XC) 


But x(¢) is a decreasing function, continuous except at¢= 0. Hence, since 
ty<t,< 0 and 0 <7, <7,, each fraction is positive, and condition (6c) is 
alivays fulfilled in the stronger form 


= 


o- J 2, 
a ,o. 


~ - 


Cry CFs | 
Since — sh 4, = sh 7,= 1 V3, condition (62) requires that for 7, = a2, yz = by, 


Ca Ca Ca Ca Ca Ca 
a(: +, cay - (4 - =) —3+3(5 - : : 
Cry Cr. Cry 


=), 


All 
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Ol 
} Cay Ca, Cay Ca, 
OL, OG 


Wg 
If we apply (15) and (16), this becomes 


log 3 + x(%) — X (to) 


siiagetaeee ! = 0. 
1X(71) — X( 72) (x(t) — x(4) | 





Since the denominator is always negative, this condition is equivalent to 
the following : 


IV 


(20) log 3 + x(71) —x(%) 29, 


from which two results may be derived. 

Let fy have any fixed value, and let 7, increase from 7, on. Then 
since y(‘) is a decreasing function, log 5 + x(7,;) — x(%) decreases from 
log 3 + x(7:) — x(4,). lence, if the latter value is negative, it is impossible 
to satisfy (20). But 

X(T) = 2 — log y3, 
and hence 
log 3 + x(T) —x(%) = 24+ log ¥3 — x(t). 

It is therefore necessary that 
(21) x(%) = 2 4+ log y3. 

If we denote hy t, the negative value of t satisfying the equation, 

(22) x(4) = 24 log 3, 
it ts then necessary that 
(23) ty =f. 

The above condition (22) being satisfied, condition (20) further requires 
that 7, satisfy the condition 
(24) x(7)) = — log 3 + x(%). 

Tf we denote hy t, the positive value of ¢ satisfying the equation 
(25) x(1) =—_—— log 3 + X(t). 
it is then necessary that 


(26) 
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For the extreme value ¢, = 4, ¢) must be equal to 7,. For the extreme value 
fy = ts, f, takes the value @ which is the positive value of 7 satisfving the 
equation 

X(T) X (fs) log o= 2 — log \s. 


As ¢, increases from /, to 4, 4) increases from 7, to 7}. 


5. Geometric construction for critical point. For these con- 
ditions restricting the position of 4, and A, we have the following simple 
geometric construction : 





Let UT, be the x-intercept of the tangent to © at Ay. Then 
OT, = 8) + ale — a coth &. 
Let OT be the x-intercept of the tangent to ©, at A,. Then 


OT = B, + at — a coth +. 
Hence 
OF,« OF =f = B, -~a Ay — 7 — coth 4, + coth T| 


= a(t, — fy) + al y(t) — x(%,) | 
= allog 3 + x(7) — x(t) ). 
The point T coincides with 7, if and only if 
(27) log 3+ x(T) — r(%) =O. 
Hence the point «1, (7 = ¢)) is the point on G, determined by the tangent from 
T,. We may say that Aj is the conjugate to A, with respect to the broken- 


extremals &,¢,, and may call our construction the Lindeléf construction® for 
the discontinuous solution. 








*Lindelof, Mathematische Annalen, vol. 2 (1870), p. 160. 
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Let, further, the tangent to G, at A, cut the z-axis in 73, and let Aj be 
the point of ©, determined by the tangent from 7j. Similarly the tangent to 
at A, cuts the z-axis in a point Tj, and Aj is the point of ©, determined 
by the tangent from 7|. As ¢, varies from f, to f,, Ay describes the curve OF 
from Aj to A,, and at the same time Aj describes ©, from A, to A}. 


6. The envelope of the broken-extremals. Let us consider the 
one-parameter family of broken-extremals (GG), 


G.: x=a+a(t—t%), y=acht, &stsh 
(28) 

OF x=%*,+a(tT+4,), y=achr, Zersh, 
with @ as the variable parameter, each of which passes through the point Ay 
and satisfies the condition 


— 1 
sh t, - . 
V3 
at the corner point 2’; (42, y,), Where (= ¢, and 7 = — ¢,. The value 4 is de- 


termined by the condition 
4, = ach 4%, 
and we suppose that 


fi, . t, =— log Ve. 


so that ul, is never coincident with the point Ag on &. 
From the above equations we obtain for the coordinates of 7: 


vy = dy + a(— log V5 — &), 
(29) i: 


» 


Yo = a, where 4, = acht,. 
V3 
Since a, and y, are continuous functions of a, 7’, describes a continuous curve 
K, which we shall call the corner curve, 

The set of catenaries ©, possesses an envelope, £, given by the second 
equation of (28) together with the following condition on their functional 
determinant : 

C(ry) _ 
G(aT) 
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The latter gives 


a asht 
C (wy) . Ny 
(30) a ig es t, — log — : cht 
eer? V4 — a° 
; Y] ) 
= asht- cotht — 7 + log3 — ty + = > 





Vii, — a?) 


=ashti y(t) — x(%) + logs! = 0. 
It therefore follows fora particular system of the set ©, ©, ©, that A(t = 4) os 
the point in which the particular catenary &, touches the envelope of the set &,. 
We shall call this also the envelope of the set of broken-ectremals (Sy ©) 


through leis 


7. The physical illustration of the critical point. A physical 
illustration is now of interest. If we construct a film between two rings as 
indicated in the introduction, the film is stable so long as the rings are sufti- 
ciently near together. Ata tixed distance, however, the film becomes un- 
stable, and divides into two separate bubbles which contract into plane 
circular films.* ‘This point of instability is indeed the conjugate point. 

We way readily obtain numerical values from our formulas. 


P Consider the longitudinal section of the film, which for 


A . : : ° . - 
. equal rings is symmetrical with respect to the point of in- 


tersection of the axis and the transverse film. Let its axis 
_X cvincide with the w-axis, and the diameter AB of one 
ring with the y-axis. Let A’ be the diameter of the 
second ring and «A,/?, that of the transverse film. Let 














‘ 


bs B (0, 1) be the point A, and (ry, y,) the point A,. Then 
FIG. 3. ° . ' 1 

(27,, 1) is the point A’. The tangents to the ares AA, 

and A,A’ at A and A’ respectively meet at 7(+,, 0). For the catenary AAg, 





y =aclhf, 
G: . 
by 


=l=achh, 


xz =a(t—4,). 








* The Goldschmidt discontinuous solution. 
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The coordinates of A, are therefore given by the equations 





4 ch¢ 
X=-—al(—t =—=-a ss 
2 ( *- ») Lh shi ty 
tz = — log V3, where l=acht. 
Hence coth 4 — 4, = log ¥3 = .5493 
and to = — 1.6292. 
It follows that 
Ps 1 3777 
~ cht, Olt, 


ty = a@ (ty — Gy) = A078, 
Yo = @ ch t, = 4543. 


From a series of four actual experiments the following coordinates for 7’, were 
obtained (see also part C, §5, below) : 


by (cm. ) Yo (cm.) Ly (Cm. ) 

3.80 1.700 1.525 

3.80 1.625 1.525 

3.80 1.345 1.525 

3.80 1.625 1.525 

average 3.80 1.630 1.325 
Reduced to scale 4, = 1 l .A2s4 OLS 
Theoretical value ] 1360 4078 


8. Necessity for a stronger form of conditions (2) and (23). 
For the envelope, /, of the set &,, the slope is dy de = shr> 0. Ilence 
# has no singular point, and the stronger form of condition (26), 


T) * bo» 


is necessary.* If ¢, = &, 7, must be eyual to @. Hence the stronger form, 


ty > G, of (23) is necessary. 





* Bolza, loc. cit., $38, p. 204. 
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9. Summary of the necessary conditions. The minimizing 
system of curves must be composed of ares of two catenaries &, &, having 
the x-axis as directrix, together with the normal ©, to the x-axis from their 
point of intersection Ay. 

&, and &, must be symmetrically situated with respect to G, in such a way 
that the three curves make angles of 120° with each other, as indicated by 
equations (8) and (12). The value of 4 must be greater than 4, where @ 
is the negative value of ¢ satisfving 

x(t) = 2 + log \3, 
or in other words, A, must lie between A, and its conjugate (in sense of $5 
above) on the curve &. Furthermore .4,; must lie between .1, and a point 
«1j determined on & by the equation 
x (7) = x(4,) - log V3. 
The point Aj is said to be the conjugate point to A, on the broken extremal 
(Gp. &).* 


B. SUFFICIENT CONDITIONS. 


We now suppose that for the extremal system ©, ©, & the conditions de- 
scribed in the last section are all satisfied. Since the problem is regular and 
since fy < tf, < Vand 0 < 7, < 7, it follows that Bolza’s conditions I, II’, III’, IV’ 
are all satistied for &, ©, E, separately. Further, under the above conditions, 
a certain vicinity of A, (a 4.), 

fy— 0, <8, Wy — by < 8, 
can be assigned in which 
p(s by) < (rz 2), 
where $(+, #2.) is the function detined in (5) above. 

Lemma. The broken-extremal @) &, is contained wholly within the 
region 22, bounded by the ordinate through A, and the 
envelope /, of the set of extremals through Ay. The 
intinite segment of &, with initial point Ay lies in Ze. 
Hence A, lies in 2). Let Ay be a point on &, sym- 
metrical to «1, with respect to the axis of &. Then 
the segment of ©, with 1, as initial point can be ob- 
tained by a lateral displacement of the are u1, % of 
€., bringing A, into coincidence with Aj, and &, is thus wholly within J2). 








Fic. 4. 


* The conditions denoted 11’, ILI’, IV’ by Bolza, are all satisfied for & and ©, separately 
under the above conditions. 
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We now construct the envelope /, of the set of catenaries through Ap, 
and the envelope F, of the set through .1,, and let S be the region common to 
the regions /2, and /?, which they define. Then A, lies in S, and & and &, 
lie in S, 











\ 
Ady 5 A 
‘ 4 
‘N Pa la, 
‘ 4 
NE @) ‘ 7 
. » 4 J 
A ne 4 F. 
a Ac ‘4 
F, as »“ 
omud X 
Fig. 5. 


We can now take 6 so small that the vicinity (6) lies entirely in S and 

we can construct a neighborhood (p) of &, and ©, also lying wholly in S. 
Let P(x, y2), be any point common to (8) and (p) and let &, &, be catena- 
ries joining /, with A, and A, respectively, and ©, and ©, any other ordinary 
curves in (p) joining 7’, with Ay and A, respectively. Let © be the normal 
from P, to the x-axis, and ©, any ordinary curve from 7; to the x-axis. 
Then 

T(&o, ©, G2) = $(ag, by), 

1( Go, ©, G2) = O(F2, Y2)s 
and hence 

I( Go, ©, Es) < L( Go, Gy, Ge). 
But I(G) < L(G), I(&,) < 1(G)), I(G,) < 1(G), 
and therefore 

(Gy, ©. G2) < TG, ©, G). 
Hence we have 

1(Go, ©, Es) < L(G, G, G). 


The conditions given in A, §9 are therefore not only necessary but sufficient 
conditions that the extremal-system Eq &, &, furnish a relative minimum with 
respect to the aduissible curves defined in A, §1, above.* 





* Our problem is similar to those considered by Carathéodory in his Géttingen Disserta- 
tion, ‘* Ueber die diskontinuirlichen Losungen in der Variationsrechnung.” The fact that we 
have three curves meeting in a point makes a separate discussion necessary. 
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(. Uxrgcexess of Extremat-Systems Witatn A Friern. 
We shall study now more carefully the region in which a discontinuous 
solution exists. 


1. The corner curve, K. We first consider the curve AY, which is 
the locus of the point 7. From (29), 


9 
n= a, 
(32 ) A: sin 
h, 
So = My + a(- log \5 - ch-! ~). 
a 
where ch~'(4, @) is positive, and 4, = —ch~'(% a). We may write the 
single equation for A, 
A 5) 9}, Po 
(32) \ : a le = i VE [en 0 —_ log \ 3]. 
' V3%, 


where the positive sign of the function ch~!(24, ¥3 ¥,) is to be taken. Here 
vy appears as asingle-valued continuous function of y, defined for the range 
O< 4, =, and sr, approaches the value “7, as 4, approaches zero. Further, 
if 4, =4,, we have r, = 1%. 
We compute from (31) the derivatives 








dr, 
dr, da \5 if = h, , hb, ) 
7—~ ——= > ,ch"'— — logy3 - — a 
dy, Uy, = | a vii, — a®) 
da 
(33) 
dy, Gry dry dPyy 
der da ‘ da; naif de ; da? he 
—- = - - = — - : ~ 0. 
dijyd ee ad da(hi ‘si a*)’ 2 
ie) 
drs . . ; —_ ; 
Hence de is a decreasing function and the curve Av is concave to the line 
L= 1%. 


dro 
Furthermore = approaches + x as ¥, approaches 0, takes the value — 2 
od I 


for 7, = 4,, and the value zero when y(f,) = log V3, where h, = a ch fy. 





he ARM. 
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This value of f, is greater than /, where y(f,) = 2 + log V3, since x (1) is 
a decreasing function, continuous except at ¢= 0, and ¢, and & are negative. 
We find approximately /, = — 1.63 and 4 = — 3.55. 

From (31) the following table, used for plotting the curve A’ in figure 6, 
is obtained ; 


fo @ v2 ¥2 

; — 3.55 L060 180 .070 
— 3.00 98 240 113 
| — 2.46 162 316 187 
— 2.00 .262 380 .302 

— 1.50 4116 ood 180 

— 1.20 06 61 642 

— 1.00 52 29: 193 

— 0.75 .t4 155 bo4 

- 0.55 S66 0.000 1.000 


As 4, decreases from — log ¥3 to — f, Ay describes the curve AT from Ay to the 
fixed point Z for which x, = .180, vy, = .070, 


ae 








Fig. 6. 
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2. The envelope &. We nowconsider the envelope / of the catena- 
ries & (a), the coordinates of which satisfy the conditions 
x=a(t—t,— log 3) + %, 
(34) BE: 
y=achr, 
where 
h,=achf,, x(t) + log 38 — x(h) = 0, 


Since y(f) is a single-valued decreasing function, continuous except at 
= 0, the equation y(?) = has for any assigned ¢ one and only one negative 
‘ solution and one and only one positive solution. ITence, in the realm 


(,< (= — log v3, log V3 =7< h, 


the values x, 7, fj, and 7 above are single-valued continuous functions of a. 





We find 
dr CT ch, 
= cothr — coth¢, + af. —- —), 
da ca ca 
dy CT 
“=ochr +ashr . 
da ca 
But 
cf, ] 
-= —  coth4,, 
da a 
ct coth* t, ct, coth® 4, ' 
ca 7 cotl? st Ca acoth? 7! 
Hence 
de coth's — coth® 4, : 


da coth?+ 


dy sh'r 
“= —— (coth*r — coth*é,) - 0, 
da ch?r 
and, as was found above, 
dy 
de 


= sh T. 


The slope of £ is therefore everywhere positive. 
Furthermore 
Py dr 


» = chr . 
dr? dy 
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Since 
wom 


Vo ‘ 
T= - — ~ + log 3 + los 


we obtain 





dt l1 «-—a“da dt, 

dx a a® dx dz 
3 

1 da 
= AL a (coth 4, + coth 7 — coth «) | 
_ [i coth' + 2 
a coth® + — coth® ‘| 
] coth®/, ) 
a coth®s — coth® ty h 
Therefore 
Py chr coth® t, KF 

dry a coth®s — coth®s, ~ ' 


Ilence / is everywhere convex to the «-axis. For the initial values, a) = 0, 


ee 


by = 1, wehave a = I/cht. Using y(4,), x(7), we compute the following table 
for points of £: 


re eee a 


ty x (fo) X(T) (7) a x y 
— 5.55 2.55 1.45 0.55 .O60 0.18 0.07 f 
' — 3.00 2.00 0.90 0.71 £098 0.24 0.11 if 
: — 2.50 1.49 0.39 0.95 162 0.42 0.24 ia 
~2.00 0.96 —O.14 1.29 2620.56 0.50 iq 
—~1.50 040 —0.65 1.73 416 9 0.89 1.20 ie 
~1.20 0.00 —1.10 2.15 556 1.252.583 ig 
~1.00 —0.31 —141 241 9 652 1.53 3.88 if 
—0.75 -—0.82 -—1.92 2.92 774 200 7.70 i 
—0.55 —1.45 — 2.55 3.99 S66 2.60 15.09 i ‘ 
7 
The curve # appears in figure 6. It touches the corner curve A’ in the if 
point L(x = .180, y = .07), and the particular curve ©, for which i i 


(9 = — log y3 in the point LZ’ (2 = 2.60, y = 15.05) where 7 = 7}. 
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3. The field of broken-extremals [(,(a), €,(a)] for which 

t, <t)=—log v3. We have now a region I (in figure 6) bounded by the 

curves &3(f) = 4) and AY, and a second region II (in figure 6) bounded by A, 

, and &. Every line parallel to the x-axis cuts the boundary of I twice 

(or not at all), once on &; and once on AY, and cuts the boundary of IL twice 

(or not at all), once on A’ or Gand once on /, for each of these curves is ex- 

pressible in the form 2 = $(v), Where @ is a single-valued function. 
‘ : ; ; : bo 

The set of extremals G,(a@), (aj < @ < a,), of the region I, where aj = che 

0 
and a, = 4\3 4,, are expressible in the form 
Z=2(Yy, @), 


where 2. 2). fas “ys Spa BFE single-valued continuous functions in the region I, 
Further 
“,= xl) — x(t) > OinT (unless ¢ = 4). 


Consider any point [2(y,a). 47) in I. If we give to y any fixed value 7; 
of I and let @ increase continuously from a’ of ©) to a, = 4 V3 y, at A, 
£( 3,4) increases continuously from 2( 7; a") to Cy, a,), and therefore passes 
once and but once through every intermediate value. Hence, if a; be any 
value of a in (a, a,) and we put ©(7;, a;) = 7, then the equation «(/, @) = 25 
has in (a’,a,) no other solution but a = a3. Through every point (75, y,) of 
I there passes, therefore one and but one extremal of the set. The region /, 
is thus a field (improper) of extremals G,(a@).* 

The set of extremals &,(a), (a, <a < a,), (29), of IL is also expressible 
in the form 


“= 2(¥, a), 


where z, LY, Las Lays Hine are single-valued continuous functions in the region 
I]. Further [see (20) ), 


rt. = X(4&) — x(T) - log 6 <O, in TI, 
By an argument similar to the one above we may show that the region II is 


simply covered by the extremals ©. 
Under these circumstances, we say that the region R, composed of the 








* Bolza, loc. cit., §19 a, b. 
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regious Land IL, constitutes a field simply covered by the broken-extremals 
(a), S\(a).* 

4. Comparison of the system (¢,, ¢,, ©) with the Gold- 
schmidt solution.} For the value of the integral over the system (G, &, &) 
we find 
‘—logig 


J (&o, Gy, &) = 2a | 


t 


chtdt + 2a @ |  olrdt + ¢ 1a’, 


log M3 


¥ 


7 o %) 
= 2 at} (sh2e + 2t) hed A log v3 


For the Goldschmidt solution, consisting of the two ordinates through A, 
aud A, together with the x-axis, the corresponding value is 


Vii - ™ (My + bi) = 4a* (ch ty = ch’ T)> 


and consequently 


(30) JG ‘ual A (So, &,, &,) 


Tv ‘ ‘ 
=>5 a*}2ch*/, + sh2¢, + 24, + 2ch?7, — sh2r7, — 27, 4+ 2 log 3/ 
= * g29 1 4.4 Ploe3 » 1 —2r,\) 

= = e1se + +e + Z2logo — (27, — —€ iyi. 


Let now Ay be tixed, and 4, be movable, on a given extremal-system. Then 
t, is fixed, and 7, variable, and we consider the function 


$(7,) = 27, —1l—e~*", 
This function has a positive derivative, 
@'(7,) = 2 + 2e“*1 > 0. 


Hence, as 7, increases from its smallest value log 45 to x, the difference (35) 
decreases continuously. On the are 1,1’ of the curve ©; we find that if 1, is 


coincident with A,, 7, will be equal to log \3 and 


AG a J (S, ,. v,) = 5 Ta- > ), 


* Compare Carathéodory, loc. cit., §11, where a similar field (fig. 14) appears for a dis- 
continuous solution of the usual kind. 
+ Goldschmidt, Prize Essay, 153v. 
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If A, is coincident with Z’, 7, will be equal to 7; = 3.55 and we have 





J, — J ,, Os ¥,) 5 ” 





the difference being approximately equal to — $(5.67)(aa*). Hence the ex- 
pression (35) vanishes at some point of &) between A, and LZ’. The totality 
of points for which this is true on the extremals ©, of the field form a con- 


tinuous curve D determined by the equations 


os amy) —-a log 3 a lo = al,, 


dD: 
y=a ch Ti» 
where 
h, =ach a 
and 
24, + 1+ eo+4+?2 low d= (27% — c_ e—?") — g. 


For the values, a, = 0, 4) = 1, we construct the following table for the 
curve D: 


‘. rT) 7 —'—log3 a g y 
— 5.0 a ee Se 16 1.10 OUS ~108 10 
—2.5 —1.79 OS 1.552 162 216 16 
—-2.0 — 78 o 1.25 262 27 28 
— 1.5 23 .t4 1.14 tli Wa a4 ) 
—1.2 So 1.01 1.11 O08 1S 7 
—1.0 1.54 1.20 1.11 652 724 1.18 
— .7o9 1.93 1.49 1.14 774 S84 1.79 
— .o0 2.45 1.73 1.18 866 1.022 2.52 


The curve D ( séé Sigure 6) divides R into two regions, one adjacent to 
Ay, in which 


Ji, — I(So, Ey, &.) 


is positive, and the other adjacent to L', in which this difference is negative. 
On D itself, J, and J (&,, &, Ex) are equal, 

_ 5. Tallqvist’s results. Experimental determination of the 
limit of stability. In the first of the two papers referred to on page Db, 
fallyvist starts from the problem to construct a minimal surface passing 
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through a given circle and meeting a given plane parallel to the circle at a given 
angle a. He derives the solution from certain formulae given bv H. A. Schwarz.” 
He then infers from physical considerations that the angle @ must be 60°, and 
finally gives the following rule for the determination of the limit of stability : 


A. 


' 











Let rp and 7, (7, > 1,) be the radii of the rings. Let Ay A, and A, A), 
be two catenaries which meet a normal Q.41, to their common directrix, at an 
angle of 60°. Let H be the projection on the directrix of the film Ay Aj, and 
h that of A, A,. Then // is determined by the condition that the tangent to 
Ay -1, at Ag(Y = 79) must pass through the foot of the normal O.A,. This de- 
termines the catenaries (see p. 15 above), and /& is determined by the point 
«1, of the catenary 4, A, whose ordinate is r;. 

This leads to the following formulas : 





275+ Vdri—3r? ry 2H 


3r \ 477, —_ 3r° ry r) 








2r,+ V4rji— ar 


—~In - 
4 3r 





from which it is found that 


” = 0.436047, 
o 


H 
= 0.407824. 
Vo 
The quotient h/rp can be found for any values ro, 7) (7) = "o)- 
In the special case where the two rings are equal, Tallqvist’s rule agrees 
exactly with our own result given in A, §7. In the general case, however, 


for the limit of stability when the two rings are unequal, Tallqvist’s rule gives 


*H. A. Schwarz, Miscellen aus dem gebiete der Minimalfachen. 








Saeaacinanaliaeesas 
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a different value from the results obtained in A, §9.* Tallqvist’s experiments 
for this case are not decisive as the two rings which he used were too nearly 
equal in size. We give below his results for unequal rings compared with his 
formulas and the results found from the formulae of the present paper, where 
b, and ¥ are the radii of the rings : 


by y x—a,= +h 
Tallqvist’s 92.455 mm 22.402 mm 18.432 
experimental values (21 trials) 1. 9OTH6 2086 
Tallqvist’s 22.455 22.402 18.206 
theoretical values i. 99766 1480 
Theoretical values from tables ) 
9766 1480 


given above j 
Since these results are inconelusive it seemed desirable to carry out a new 
series of experiments in which the two rings should be of quite different 
diameter. The experiments were conducted as follows (see figure 8) : 





























A square plate of brass 1 with a circular opening, the latter beveled to 
a knife-edge at the upper surface, was put in horizontal position by means of a 
spirit level and held rigidly in place. Above this was suspended a ring C 





* Compare also C §2, formulas (34). 





easter, 
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made from a band of heavy brass, beveled at the bottom on the inside to a 
knife-edge. This ring was attached to the arm of a stand .S. which was pro- 
vided with a screw to give smooth vertical movement and a vernier scale to 
ensure accurate measurement. 

The upper face of the brass plate was covered with a film, the ring 
lowered to it and withdrawn, At first there appeared beside the horizontal 
film across the circular opening of the plate a single catenoid extending from 
the ring to the plate. When the ring was withdrawn so far that the lower 
margin of the catenoid became as small as the circular opening, the films were 
transformed into the desired system. The height was then adjusted until the 
limit of stability was found. The diameters of the two circles were measured 
with calipers. The following are the results, where J, and D, are the 
diameters of the two rings, //7 the distance between them, or the height of 
the film : 


D, Dd, /1, 
87.7 mm DL. min 27.5 mm 
87.8 51.15 27.25 
87.4 o1.10 27.25 
87.0 51.20 27.3 
87.1 o1.10 27.4 
87.6 o1.15 27.55 
87.8 o1.15 27.4 
S7.7 o1.15 27.30 
87.0 1.1 27.55 
87.0 j1.15 27.3 

27.5 
27.5 
27.55 
27.3 
27.3 
27.35 
Average 87.52 51.14 27.35 
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Then, if 22, and 2, represent the radii of the two circles, and if we reduce 
to the scale for which /?, = 1, we have 7% = 1, 2, = .58t4, and /4 = .0244, 

This value of //, can readily be compared with that given by formulas 
(', $2 (54). We have 


a, = 0, h,, =l=ach las 


i 


KN) — XN) + log 5 =O. 
Then 


Y= osetia a ch(f)). 


/1, = a(t, —f,- log >). 


These equations have, for ¢, negative and /), positive, a unique solution, found ; 
by the method of approximation : : 
a= .2835, f = 1.933, 6 =1.3515, 2 = MH, = .6195. 

Tallqvist’s formulas give r —a, = // +h = .5795. By the formulas here given, 

the error is 0.8 7, by Tallyvist’s 7.2 (7. From these results, it may be in- 

ferred that Tallyvist’s theoretical determination of the limit of stability is 

wrong, and that the minimum problem which nature solves in the experiment 

of the two rings is actually the one which we have formulated in the intro- 

duction. 


OBERLIN CoLLEGi i 
OBERLIN, Otte. 3 


: 
' 








ON THE GEOMETRIC PROPERTIES OF QUARTIC CURVES 
POSSESSING FOURFOLD SYMMETRY WITH 
RESPECT TO A POINT* 


By R. D. CarMIcHAaEL 


Ix a previous paper ft I have considered the problem of classifying, by 
the form of their equations, plane algebraic curves possessing fourfold sym- 
metry with respect to a point. The center of symmetry being taken as the 
origin, it is there shown that all such curves are of two classes: and for 
quartic curves it readily follows that the equations of the two classes in their 
most general forms are respectively ; — 


(A) 2ay(at + y) + 4a, ey (a? — 9?) + dey ty? + 0g(0* + ) + ay = 0, 
(B) ry (ct — y') + 2r,(a2° + y*) + r, (0? — y") + 2r; ay == @, 


In this paper attention will first be given to the classification of the irre- 
ducible forms of these curves according to the number of their singularitics 
of each kind. For this purpose we shall require certain of Pliicker’s equa- 
tions. By n, m, 8, 7, p, «, we shall, as usual represent respectively the order, 
class, number of double points, number of double tangents, number of cusps, 
and number of points of inflection of the curve. The Pliicker equations are 
then the following :— 


(1) m= n(n — 1) — (26 + Sp), 

(2) n = m(m — 1) — (27 + de), 
(3) t =d5n(u — 2) — (66 + Bp), 
(4) p =sm(m — 2) — (br + Be). 


From the fact that these curves possess fourfold symmetry with respect to 
the origin it follows that singularities not at the origin can enter only by fours. 





* Read*before the American Mathematical Society, October 26, 1907. 
+ ANNALS OF MATHEMATICS, ser. 2, vol, 9, pp. 53-56; January 1908. 
(81) 
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Moreover, even at the origin if one cusp exists there must be three others 
corresponding to it; for, if not, the curve would not coincide with its former 
position after one-fourth revolution about the origin. Hence the number of 
cusps is a multiple of four, or zero. But sinee iw + 26 is positive and x is four it 
follows from (1) that p is less than four. Tlence p is equal to zero; that is, 
the curves in consideration are non-cuspidal. Now, a quartic curve has not 
tore than three double points : and therefore these in consideration can have 
no double point except at the origin. 

In group of, if «4, <0, the locus does not pass through the origin, and 
hence the curve has no double point. If a, = O and a, + 0, there is a double 
point at the origin. If both v4, = O and a, = 0, the equation becomes homo- 
geneous in rv, y and hence breaks up into linear factors. The locus is then a 
group of straight lines and is therefore excluded from further consideration. 
In group /? there is always a double point at the origin if 7, and ry are not 
both zero: and if ry =r, = 0, the equation breaks up into factors and the 
curve is reducible. If there is no double point, that is, if 6 = 0, it follows 
from equations (1) to (4) that aw = 12,4 = 24,7 = 28. If there is one double 


point, or 6 = 1, we have m = 10,2 = 18,7 = 16. Tabulating we have: 


Group ut: Case Toa, + 0 $ 12 0 O 28% 24 
(rroup wl: Case IT, a =9% i 101 0 6 18 
(iroup 2 tf oO 1 © 16 18 


Since 6 is in no case greater than one, none of these curves is unicursal. 
The tirst one belongs to Salmon’s group Land the other two to his group II.* 

We shall take up these cases separately and determine for each the 
general form of the locus and certain other geometric properties in addition 
to those already given. 


Group A, Case I. Making the substitution « = p cos @and y =p sin 8, 
and reducing to more convenient form, we lave the following polar 
equation : 


(5) (2a, + (4, — 4) sin? 20 + a, sin 40) p' + ay p? + a, = 0. 








*See Salmon-Fiedler’s Analytische Geometrie der hoheren ebenen Kurven, second ed., 
p. 278. 
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The coefficient of p* may be written in the form 





a,—a ay — a. 
‘ : 0 . » 
2) + — — + a, sin 40 + —* = cos 48, 


which by the substitution of 6 — ¢ for 6 becomes 


2a, + = = #0 +. sin 10(«, cos 4g + Ol sin 19) 
+ cos 19 (“2 * cos 1d — a, sin 19) ° 
Putting 
; r My — Us 
: tan 4d = Da, 


ce 


in the above expression and substituting in (5), we may write the result in 
the form 


(») (A, + B, sin 40) pt + Cyp? + a, = 0, 
o 4a? + (%) — a,)* 
where \ A, = 2ay + a and bB, = a sa C2) cus 4m. 
~ siete 


Since a, 4 0 we may divide by it and write the result thus: 
(A + Bsin 40) p'+ Cp? + 1=0, 


Bb U' 
I ‘ 1 
where A= » B=—, C= : 
a, 4 Uy 


If C' = 0 the equation reduces to 


(f  —1 


= + _ ° 
ee * VAG Bain 10 


The discussion is entirely analogous to that for the general equation in Case 
Il of Group A below. It is unnecessary to give it in detail. 


If C' + 0, we have 
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where r= .1+ Bsin 40. The two values of p corresponding to @ and 
@ + 180° are evidently the same, and consequently the first double sign in (7) 
may be omitted and the resulting equation will give, when @ varies from 0° to 
360°, every point of the locus which is obtained when the double sign is kept. 
Hence we may write 


_C + V0? — dr 


2r 





(3) pP=\ 


If r < 0, there is one real value of p. 

If , = 0 — *, there is a finite real value of p when C’ < 0, and an infinite 
real value of p when C' > 0. 

If r > C?/4, there is no real value of p. 

If x = C27 4, there are two coincident values of p when C' < 0, but no real 
value when (' > 0, 

If r>Obut < C24, there are two real and distinct values of p when 
CU < 0, but no real value when (> 0. 

If r= 0 +, there is a finite real value and an infinite real value of p when 
C' < 0, but no real value when C' > 0. 

Now, if B is positive, 


A—-Basars A+B. 


If B is negative it should be replaced by — ZF in the following discussion. 
A discussion of the different possible figures in terms of A, 2, C’ and hence 
in terms of the a’s offers but little difficulty. 

1) If A<— BL, ris always negative and the locus is a simple closed 
curve. 

2) If A=— BL, r is negative or zero. When r + 0 there is a sing 
finite real p for every 6. When r = 0 — there is a finite value of p if C < | 
and the locus is a closed curve. If ( > 0, the curve has infinite branches, 
of which the asymptotes are given by r = 0. 


le 
Be 


3) If B> A> — Br has both positive and negative as well as zero 
values. If C’> 0 we have a single branch in each quadrant, corresponding to 
a negative r, and approaching as asymptotes the lines 7 = 0. 
correspond to values of @ for which r is positive. 


No real points 








* That is, approaches zero through negative values. 
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That the lines = 0 are asymptotes of the curve in this case and in the 
preceding case may be proved by showing that the distance of a variable point 
(p, 9) of the curve from the line r = 0, which is p sin (a — 0), where a is the 
value of @ for which r = 0, approaches zero as @ approaches a. We have 


A+ Bsin 40=r, i+ DB sin da = 0, 


r= B (sin 46 — sin 4a) = 22 sin 2(0— a) cos 2(0 + a). 


Then p sin (a — @) approaches zero as [sin (a — @)}!, unless cos 2a = 0 or 
B=0. If B=0 we have also A= 0 and the curve is excluded from our 
consideration, being of the second degree; if cos 2a = 0, we have A = 0 and 
sin da = 0, so that r= Bsin 40. It may be seen that this case is no excep- 
tion, but that here also p cos (a — @) approaches zero as [sin (a — @)}!. 

To discuss the form of the curve when C’ is negative, consider first a 
value of @ for which r is negative, and let 6 change through 90° so that 7 in- 
creases to zero, becomes positive, then decreases to zero, becomes negative, 
and returns to its original value. There is at first one value of p remaining 
finite as 7 = 0 —; when r increases through zero there is a finite value and an 
infinite value (7 = 0 +), so that the finite branch is continued and a second 
branch has the line r = 0 as an asymptote. That r= 0 gives an asymptote 
may be proved as before. If r remains less than C?/4, these branches remain 
distinct, the infinite branch having a second asymptote when r+ = 0 4, so that 
the locus is a closed curve and four infinite branches like these of a hyperbola. 
If, however, r becomes greater than C?/4, the two branches come together 
aud have a common tangent from the origin for r= CC? 4, and for values 
greater than C?/4 no real values of p. If uf + B=C?/4, so that C? 4 is the 
maximum value of r, we should apparently have a double point, not at the 
origin, and so contradictory to our previous result. It may, however, be easily 
seen that in this case the locus is not irreducible, but consists of two hyper- 
bolas. It is of course not proved that the curve is irreducible in other cases. 

4) If A = B, r is equal to or greater than zero. If C > 0, there is no 
real locus. If C < 0, there are two distinct values of p, one becoming infinite 
with » = 0, and the locus is as in the previous case where A + B< C?/4. 

5) Finally, if A> B,r>0. When C>0, the curve has no real 
locus. When C < 0, we have two finite values of p, always distinct unless 
A—Bs 0?/45 A+B, If C*/4 lies outside of these limits the locus 
consists of two separate closed curves, each surrounding the origin ; if between 
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A— Band A + B, the curve consists of four separate closed curves, each 
excluding the origin, whose tangents from the origin are given by r= C? 4; 
if finally C2 4 is equal to .f— Bor A + B, the locus reduces to two hyper- 
bolas, 


Group A, Case II. Putting 7, = 0 in equation (6), and dividing by 
— (\, we may write as the polar equation of the curves to be considered here, 


(%) (44+ Bsin 10) p'— p? = 0, 


where the values of .1.and Bare easily found in terms of the a's. From (‘) 
follows 


(10) p* — 0), 


(11) PN 44 Bsin 40’ 


Corresponding to the values of p given by (10) is simply the origin, an iso- 
lated point. In (11) the double sign has been omitted for the same reason as 
that given in the discussion leading to (8). 

We have seen that this locus has 18 points of inflection. Moreover, those 
not at the origin enter by fours. Hence the isolated point at the origin must 
also count as at least two points of inflection. If there are more than two 
points of inflection at the origin there must be six, so as to leave those not at 
the origin a multiple of four. But six coincident points on a quartic is impos- 
sible; and therefore the origin counts as just two points of inflection. 

Again setting r = -f + 2 sin 40, it will not be difficult to discuss the 
locus of (11) in terms of .fand Z. We have as before 


A~Ba va 4 + SS. 


if B is positive, with the same modification as before if B is negative. 

1) It A Ss — B, there is no real locus. 

2) If B> A> — Br has both positive and negative values. Negative 
values give no real points. Positive values of r give the locus consisting of 
four branches, each like that of a hyperbola and having two asyinptotes given 
byr=0. 


3) If A= B, the locus consists, as in the previous case, of four branches 
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extending to infinity, each having an asymptote common with the next 
so that the asymptotes are along two perpendicular lines. 

4) Finally, if .l > 2, we have r > 0, and there is then a single closed 
branch. 


Group B. Transforming the equation by the substitution « = p cos @ 
and y = p sin @, and reducing, we have: 
(7 cos 20 + r, sin 20) p' + (r, cos 26 + 1, sin 20) pe = 0. 
Replacing 6 by 6 — ¢ and setting 


r 
. ») ae 0 
tan 20 = A, 

ry 


the equation takes the form 
p'A, sin 20 + p*( B, sin 20 + C, cos 20) = 0, 
where 
A, = ry sin 26 + 1, cos 29. 
I}, = rz sin 26 + ry; cos 2¢, 
(', = r, cos 2h — ry sin 2¢, 
If A; = 0 this equation is of the second degree and need not be con- 
sidered. If A, 4 0 we may replace the equation by 
p' sin 20 + p?(B sin 20 + Ceos 20) = 0, 
where B= BA, and C = C, A). 
This equation is equivalent to the two 
p? = 0, 
p==\ ~ B- Cetn2é. 


The first of these equations gives only the origin. The second isa circle, 
real or imaginary, if C’ is zero. If (@ is not zero the locus consists of two 
real branches, each having an inflexion at the origin and approaching both 
ends of one of the coordinate axes asymptotically. 


ANNISTON, ALA., 
OcToOBER, 1907. 
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EXTENSION OF THE SIEVE OF ERATOSTHENES TO 
ARITHMETICAL PROGRESSIONS AND 
APPLICATIONS * 


By J. C. Mornenkap 


AtmMost without exception the methods employed in the elaboration of 
the various existing factor-tables are modifications, more or less remote, of 
the classic Sieve of Eratosthenes. In most instances these factor-tables are 
designed to show the prime numbers and the prime factors of the composite 
numbers occurring in sets of consecutive integers.t The Sieve method, as 
applied to the determination of the primes in a set of consecutive integers 


9 ») 
”?, TL ened Re m+ by i+ OD, + + + We M, 


may be briefly described as follows: We strike out from the numbers of the 
set the multiples of 2: from the remaining numbers of the set the multiples 
of 5; then, from those vet remaining, the multiples of 5, and so continue, 
for the multiples of all primes not greater than y (# + ”). The numbers finally 
remaining in the set are all prime. If, as the procedure just described is 
earried out, the multiples of 2,5, 5, +--+ are conveniently indicated, we ob- 
tain what is called a factor-thle for the set of numbers considered. 

The present paper describes an analogous method applied to successive 
numbers in arithmetical progression. The discussion in §§1-5 is quite 
general, while the remaining part of the paper is oceupied with the discussion 
vf applications and special cases. 

1. Extension of the Sieve method. Any arithmetical progression 
of m integral terms may be written, in direct or inverse order, in either of 
the forms 


(1) a’ +- h, Dak - /,, hat + 4, : 2 2 mark + h, 
(1') —-@+6, —~$e +46, — ga? +5... — mgt +B, 


*Kead before the American Mathematical Society, Apvil 29, 1905; December 29, 1905. 

+ The Factor-Tables of J. Burckhardt, Z. Dase and J. Glaisher applying to all integers 
under 9000000, their recent extension by L. D. Lehmer, under the auspices of the Carnegie 
Institution, Washington, D. C,, to 10000000, and the tables of “High Primes” of A. Cun- 
ningham and H.J. Woodall are the principal tables of the sort that have been satisfactorily 
verified. 
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if suitable integral values are assigned to the symbols, «, b, k, a. The fol- 
lowing general discussion may be limited, therefore, to progressions of the 
forms (1) and (1’). Evidently-we may assume, without loss of generality, 
thit a is positive and / positive or zero. We shall further assume, unless 
the contrary is expressly stated, that 4 is prime to a*;* for each term in (1) 
and (1') would admit the common divisor of 4 and a* as a factor, — a case 
which it is useless to include in a search for prime numbers or in the formation 
of a factor-table. 

The methods and results obtained in this paper have for basis and starting 
point the following 

THeoreM. Let p be any prime number that does not divide a®, and let the 


least non-negative solution of the congruence 


rae 


4h=0 mod p, 
for assigned values of a, b and k. he 


= = “hp 


Then only the 
(Xp)™, (pt ta”, (2p + Zay)™, s+ (pia, + a 
numbers in (1) are divisihle hy p and only the 
(p = ay)™, (2p oes a,,)", (3p P t.,)™, _— ( pu}, _ Zay)™ 


numbers in (1') ave divisible by p, ity and My being the maximum integers 
such that 
pm, + Vkps pm, — Xp = im.t 


As here defined, x,, is clearly a function of 4, p, «and 4. This will oc- 
casionally be indicated by writing .“,,,,. However, when it is believed that 
no ambiguity or confusion will arise the symbol will always be abbreviated to 

5 A 
Typ) OY EVEN Fy. 

The above theorem leads at once to the following 
* An exception is made in the application of the relation (.W) in §11. 

+ From well known properties of linear congruences it follows that no solution 7,, exists 
when p is a factor of a‘; but for all other prime values of p an unique solution exists such that 
OS", Sp — 1, and that all other solutions are represented by 


eos ap + My. 2 = t 1, 2, SS,°°*. 


The validity of the theorem is thus immediately rendered apparent. 
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EXTENSION OF THE Steve oF Eratostuenes. Let a be the greatest prime 
not greater than YM, where M is the maximuin numerical value of the numbers 
in (1) and (1') for an assigned set of values of a,b, k and m, Let the values 
of x, be known for all prime values of pus fur as m, Then to determine 
the primes in (1) we strike out from (1) the 


(te). (2+ 2p)", (2% 24+ ey). (38 & 24+ ae), + + + (2m + Tye)" 
numbers, which are multiples of 2+ then the 
(t.3)"", (8+ ey)", (2 % 3 4+ ag). (3 XK B+ ayy), + + + (Sang + 2g)" 


wumbers, which are multiples of  E- the a the 


(z,,)", (5 re x,.)", (2 x os ee Sun y™., (3 4 4 \ t.,)™, es (5m, 4. Xp,)™ 


numbers, which ave divisthle hy 5+ and so continue, until finally awe strike 


out from the remaining nuwhers the 


(2 _y™. for 2 ey, (2er i ay)", (ier L >. yth eee. ae x,,)™ 


nunbers, which are neltiples of 72 the wniubers vow remaining ave the prunes 
occurring in (1). 


Similarly, hy determine the proues tn (1), we strele out from ( 1’) the 


2 — 2)", - > + (2my — Ze)" 


x 
(s—sz,,)", (2X 3 —apy)", (3 x 3 — ey)", - - - (Simg — 24)" 


a Fee at (2r — £,,)™", (37 — ie is (Tim. — Ze, y™ 


numbers, and those finally remind in ( 1’) are prove. 

If, as this PLOCESS ws carried out, the wadtiples of 2,454,595, 7, +++ are 
tudicated, ive obtain a fuctor-lable for the PVOGVESSIONS (1) and (>. 

When ya, fry. fn. 0 fp, are Known, this “sifting” is essentially iden- 
tical with the ordinary Sieve. In order, therefore, to render this extension 
of the Sieve method in any degree practicable, it is necessary to devise means 
of calculating expeditiously the numbers “,, tor stwceessive primes and for 
ussigned values of a, band hk. 


2. The calculation of ,,. Connection with residue tables. 
The choice of the most convenient means of computing Lips t. @., Of solving 
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the congruence 

(A) xa* + 4 = 0 mod p, 

depends, in general, on the relative values of 4, a and p. The direct solution 
of (.1), even for small p and for the few cases where residue tables are 
available giving the least remainders (mod ») of a’, is quite tedious. How- 
ever, when such tables are at hand, we express the solution of( A) in the form 


4 
i = — ba’-* = — 6a"-* mod / 


B ti, =- 
( ) hy a 
where v is the principal exponent* of a (mod p), and /’ is the least positive 
remainder of 4 (mod v), take the least remainder, /?, of a’—*' from the tables, 
and compute directly 7,, as the least positive remainder (mod p) of — 622. 
When 4 = + 1, we may thus read the successive values of ,, directly from 
the residue tables.t Thus for a=6, b=—1, p=13, k=, we have 
x, = 6%—-8§ = 7 mod 13. 

3. Calculation of x,,, from «,. First method. In many cases, 
especially when series for different values of 4, but fixed values of a and 4, 
are under investigation, it is found expedient to employ one of the following 
methods of deriving 7,4), from 7,.,, 7,42,,. from #4 ),,, and so on, be- 
ginning with a known : Setting & = 0 in (74), we obtain 


(C) ty = — b mod p. 


Hence xo, is in every case known or readily determined, and theretore the cal- 
culation of 2, for successive values of 4 may be begun with ‘9. 
Replacing « by +,, and writing (.1) in the form 


x). 
: at! a4 4 = 0 mod Ds 


7 
we have at once 
a) v; + YP 
(/)) p41, = = f mod }), 
a a 
* The minimum integer, ». such that avy = 1 mod  p. 


+ The only printed residue table of suflicient extent to be of use in this connection is 
A Binary Canon, by Lt.-Col. A. Cunningham. This table gives the least positive remainders of 
2° for x = 1, 2,3,°°* », and for all primes under 1000.) An extension of the table (as yet in 
MS.), by Col. Cunningham and Mr. I, J. Woodall, includes ail primes under 10000 for 4. 100, 
and all primes under 12000 for z S 36. Hence when a = 2, p < 12000, and & is such that » — ¢y 
falls within the range of values of « just stated, the Binary Canon is directly applicable to the 
computation of x,,. 
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where + is an arbitrary integer. Since a and pare mutually prime, an unique 
choice of from the numbers 0, 1, 2,---@—1 is possible, such that 
(x, + yp) a is integral and less than p. When y is so chosen,* we may 
write instead of (1) 


, ep J UP 





For example, when a = 2, y = 0 or 1 according as sz, is even or odd; 
and therefore the computation of x,,, from «, involves simply division by 2 
when 2, is even, or the addition of p and then division by 2 when «, is odd. 


Thus, for as?, h =] and p = 37, X= Ob: “= 36 2 = 18: Le = 9; <3 = 


(a — 57) 2—23: y= (25 + ot) 2 = 30: ete. 
~ Again, when @ = 10, y may be chosen at sight so that the final digit of 
yp + x, will be zero.t Thus, for ¢ = 10,4 = 3, and p = 43, we have zy = 40; 
m= 435 m= (442 x 43)/10=9; 7,= (94+ 7 X& 43)/10 = 31; ete. 
4. Calculation of «,., from «,. Second method. For brevity, 
set E&, = yp, und &, = +,,,~,. The following relations are always satisfied, 
£,, &, and +, referring to the same « and p. 


(F’) Spa, = — Eo, ) 
(F") Troi = Sr 

(fr ) ign =A(— &)"% 
(4) teen = Er, 


» mod p. 
(I) , 


(//’) 


eth = E,2, 
“poh = Ente 

(J) p+ nn = (— Ey)"% 
(J°) 





° " tl ne 
Ak +nh = E, can 


* The selection of y involves the solution of the congruence yp + * = O moda, the diffi- 
culty of which, compared with the labor of solving (A 
relative values of a4,,andk. The application of (EF 
large, which is not the case with the solution of (A). 
is clearly the preferable method. 


directly, will generally depend on the 
is essentially the same for & small or 
Also, if p> a, the application of (F) 


+ In the case a = 10 the use of ordinary commercial adding machines greatly facilitates 
the de ivationof +, _, from +,; for the process reduces to the addition of p to +, repeated until 
the tinal digit of the sum is zero, and then dropping the cipher. ‘This procedure may be moditied 
80 as to be applicable to certain other special cases. 
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Evidently (7), (G@) and (//) are contained in (J), and (#''), (G’) and 
(//') are contained in (7’). All, however, are useful in computation, and 
are written together for convenient reference. 

The demonstration of (7) and (.J') establishes the validity of the six 
preceding relations as well. From the definitions of x, and &, we have 


(¢) Vy 


—ha-* mod p, 
(7 ) — &, =a~" mod p. 
Multiplying (¢) by (y) n times, member for member, we obtain 


— ba~**+") mod p. 


(— E, ) "Zy. 


But by definition 
Ve 4 nh = — hy—(k + nh mod p; 
therefore 
Tr 4 nh = (— &)"% mod p. 
The demonstration of (.J') is similar. 

The computation of 7,4), % 42+ OF %,4nx from x, by means of the above 
relations involves only the dérect calculation of the least positive remainder 
(mod p) of a product. Thus, for p = 41, a = 14 and 4 = 5, we have, from 
(C’) and (£), 

& = 1, & = (1 + {1) l4 = 3; Xy = 36, 
and therefore, by (F’), 
4, =3 X 36 = 26 ) 
t, =3 X 26 = 37 
r,; = 3 x 37 = 29 } mod di. 


m4=3xX W= 5 





Thus (F’) and (F’’) are admirably adapted to the compution of x, for succes- 
sive values of &, yielding a method generally practicable. For a few special 
values of «, however, the method of §3 involves less reckoning and is there- 
fore preferable. Any one of the formulas (G), (G’), ---(J') may be em- 
ployed in calculating 2, for non-consecutive values of 4, — in practice, that 
one is selected with which the smallest & or £’ may be used. Applied in this 
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way at intervals, the congruences (G), (G"), +++ (-J') are especially valua- 
ble as check formulas. 


5. Relation between -;,,,,, and 7,,,_,. For brevity, we write 
Te =" epon ANd FE = Mppq—» By definition 
ry +h = 0 mod p, 
rat —b = 0 mod p. 
By addition, we obtain 
(r, + rp) a® = O mod p: 
hence 


r, + 7, = 0 mod p, 
and since z,. 7; < p, We have at once 
(Av ) 4, +2, =P, 


except when «, = 2; = 0, ¢.e., when 4 is divisible by p. This exceptional 
case, b divisible by p, requires no computation, and may therefore be excluded 
from discussion in this and the following section. * 

If 4 is replaced by — 4, formula (C’) becomes 


(C") Sey x 6 mod p, 


which could have been inferred also from (Av). It follows as an immediate 
consequence of the definition of .;, that the same methods and rules that 
apply to the computation of x7, apply equally well to the computation of .r;, 
starting from the initial value given by (C’). 

When one of the numbers «,, 7; is known, the other may be obtained by 
the relation (A): however, when a table of successive values of x7, and 2} 
together is being formed, as illustrated in §6, (AU) is employed most effect- 
ively as a check formula. 


6. Calculation of 7, and +, when a= 2. As has been noted in 
§4, the calculation of +, ., from #, by means of formula (2) is extremely 
simple when @ = 2, involving only division by 2 when x, is even, and the 





* The relation ( ) is contained implicitly in the theorem of §1, a8 may easily be seen if 
we note that the numbers in the series (1) and (1'), with the signs changed throughout, refer 
to—b. In fact, the 2, forthe numbers in (1'), with the opposite signs, is simply p — 7, i.6., te 








ae 
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addition of p and division by 2 when 2, is odd. Even greater simplicity is 
attained when the computation of successive values of x, and x}, is performed 
simultaneously by a method based on the relations :* 
(L) T+ Te4. = pay, When x, is odd, 

/ ' . 
(L') Te + Mp4, = Tea 1, When x} is odd. 
We shall demonstrate only (), as the demonstrations of (L) and (Z’) are 


similar in form. 
By definition of x,, we have 


r.a* +h = 0 mod p, 
and multiplying by 2, 


(/) v,a' +! + 2b = 0 mod p. 


Also, by definition of 2}. 4}, 
(kh) v4 ,0*+! —~b6 =0 mod p. 
Adding (/) and (k’), we obtain 


(7, + ep 4,)°*! + b= 0 mod p, 
and therefore 


(7) ay - r; 2 = Zh 1 mod p- 
In keeping with the conventions stated in $1, » = 2 is excluded when a = 2, 
and hence it follows from (/v ) that one of the numbers ,, 7; is even and the 
other odd. Let x, be odd; then x}, is even and 7,4, = (2,)/2. But, from 
(W), 2, + 2, =p, and hence w, + +.) = 7, + (7,)/2 <p. We may there- 
fore, when x, is odd, replace the congruence (/) by the equation 

e+ "ee = Meee 


The formulas (1), (Z') and (F) yield the following compact method for 
the calculation of successive values of x, and x; when a = 2. 








* (L) and (L’) are valid only fora =2. The generalizations take the forms 


(m) r, +(a-— 1 rr r=.) mod ;), 


’ 
ri, modp, 


(m') af + (a — 1)z, - 
respectively. These congruences are not especia'ly useful in computation when @ | 2, since 
when one of the numbers x, | ,, gv! _, isknown, the other can be found from (JT ) more readily 


than from (m) or (m’). 
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The numbers x, and 2) are known and one is even; hence one of the 
numbers .,, «; can be obtained by a single direct division by 2, and the other 
by the application of (L) or (L'), involving the addition of two numbers, 
each of which is less than p: then one of the numbers 2,, 7; can be obtained 
by a single direct division by 2, and the other by the application of (/) or 


’ ’ 


(L'): and so on, for ay, yi ey ryt 

The compactness of the method and the speed it is possible to attain in 
the reckoning are perhaps best understood in connection with a few examples 
arranged in tabular form as follows. The number heading each column rep- 
resents /, and the upper and lower numbers of the pairs in the columns rep- 
resent the corresponding values of ., and x; respectively. 


k= Oo; 1 3 3 $ |] 5 6 7 8 9 10 


oe 67 ) a 60 i) 30 7 1b = i ot 27 17 a7 62 31 19 
b= Th = 7 37,, 52) 2% 13 mW 2 10 5; 26; 18 
p = 16231) ‘= 1G104 8052. 4026) 20130 9122 4561 10596 3519S) 25909 9415 12823 
b= 127 fat = 127) S179 12205 (14218 7109 11670) 9835 11053 136832 GSI6 S108 
a a, ue : 30; 215: 323! 377 404; 22 101) 266 133, 282; Mi 
b= ifaf= 1, 216 108| 5! 27) 2200-330, 165 OSD, 290 


In the above table, the numbers in the rows x, or the rows a} that are 
obtained by division by 2 are sufliciently evident. As illustrations of the 


application of () and (Z'), we have, for p = 67andh = 7, #7, =7 + 380=37; 


% = 30+ 15> 52; = 15 + 206 = 41; = 41+ 13 = 54: ete. For g= 
16231. and4 = 127, re = 41501 + 5835 = 10596; ae =—- DSH + 5198 = 11033: 
etc. 


As illustrated in the above cases, we always have, when « = 2 and + is 
numerically less than p, 2, = (p —“)/2 and zj = (p + 4)/2, 

In the manner indicated one may, after a little practice, compute a table 
of the kind indicated above almost as rapidly as the numbers it contains can 
be written down. For the single division by 2, or else the single addition, 
by which each of the numbers that are to be entered in the table are obtained, 


may be performed mentally as the numbers are recorded. Furthermore, when 
xz, and x; are calculated by this means, the formula (A) yields an independent 
and effective check, and one that can be applied at a glance. When the ar- 
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| rangement of the table is as above, for example, the sum of the two numbers 
in cach rectangle gives the corresponding prime. However, it is unnecessary 
to check each pair, 2,, 2, but it is sufficient to apply the test at intervals of 
a half score terms or more. If ordinary care is exercised, a table thus com- 
puted and checked needs no further verification. 


7. A table of values of «, and «’. for «= 2.* The author has 
undertaken the elaboration of a table of values of a, and #), fora = 2,6 = 1, 
A= 1, 2,5,... 100, and for all primes under 100000. This table is now 
under way. A table of x, alone for / = 1, 2, -- + 19 and for all primes un- 
der 25000 was computedt by the method of §3, but the formation of the 
larger table for both +, and .x;, was subsequently begun anew by the more rapid 
method of §6. The arrangement of the table is that shown in §6, except that 
the tirst column is headed p and contains in order the successive primes 3, 5, 
7, 11, +++ and the column for / = 0 is omitted. 

The table, when completed, will be applicable to the determination of 
primes and the formation of factor-tables of numbers of the forms x2 + 1 up 
to 10", and to the determination of all factors under 1OQ000 of numbers of 
these forms beyond 10", for / = 100, 

8. An application of the table. We pass now to the application 
of the table to the determination of the primes in the two arithmetical series : 


(2) 241, 202%) 41, 3(2%) 41,--- 9 121102) +1, 


(2) —2%4+1, —2(2%) +1, —5(2") +1, ---—1211(2") + 1. 


If we include 1 with the numbers in (2) and (2'), we may write instead of the 
two series the single arithmetical series : 
(2") —1211(2") + 1, 1210(2") + 1, - 

41, 1, 2%+1, 2(2%) + 1,--- 1211(2%) +1. 


Writing down in order the coetlicients of 2" 


(Il) —1211, — 1210, —1209,.. 
3, = 2, ~ 1,6, §,3.2,-~- 1908, 1280, BSE. 


. we obtain 


* A paper of Mr. L. L. Dines, to be published in the April number of the ANNALs, con- 
tains a detailed discussion of the case a = 6 and ) = 1, with various applications and the de- 
scription of a table he has elaborated. 

+ By the author and an assistant, Miss C. Ames, in April, 1905. This table is included in 
the appendix of the author's thesis on Fermat's numbers and numbers of the forms 2‘g = 1” 
on tile in the Yale University Library. 
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Referring to the column headed 19 in the table, we find 


. a . a a - —— 
Vio 3 = l, 19,5 = 3, “19,7 = Bs “ig 1 = a. > 6 


Striking out from the numbers in (II), the positive and negative multiples 
of 3, increased by 1, —namely, 1, 4, 7, 10, 13, -- - 1201, 1204, 1207, 1210; 
—?2,—5, —8, —11, — 14, - - - — 1202, — 1205, — 1208, — 1211; then from 
the numbers remaining in (IL) the multiples of 5, increased by 3, —namely, 
3, 8, 18, --- 1203, 1208; —7, — 22, ---— 1207; then from those remain- 
ing the multiples of 7, increased by 3, —namely, 17, 24, --- 1200; —4, 
—18, +--+ —1194; and so continuing, for all primes under y }1211(2")+ 1{, 
the numbers finally remaining in (II), except 0, are the coefficients of the 
prime numbers that occur in the series (2"), or, which amounts to the same, 
in (2) and (2’). 

The author has carried out the work for this series as indicated, and has 
found that the following numbers in (II), and these only, correspond to 
primes in (2") : 


— 1198, — 1195, — 1186, — 1164, — 1156, — 1143, — 1141, — 1154, — 1129, — 1126, — 1125, — 1123 
— 1108, — 1104, — 1095, — 1090, — 1084, — 1085, — 1060, — 1045, — 1044, — 1039, — 1030, — 1021 
—978, —973, —958, £956, —925, —924, —919, —910, +894, —S888, — 885, — 883 
— 868, —3853, —850, —S828, —s80t, —S8Ol, —793, — 778, —759, —754, — 748, — 745 
— 735, —733, —730, —726, — 705, — 700, —696, —685, —664, —663, — 654, + 6O 
— #03, —601, —600, —595, —579, —556, —510, —534, —520, —493, — 484, — 463 
—460, —456, —444, —441, —421, —415, —414, +411, — 408, —400, — 394, — 391 


+ 369, —363, — 360, — 339, +324, —315, —313, —306, —303, —285, — 268, — 265 


+ 264, —24]1, —234, —229, —223, —213, —208, —199, —153, —150, — 139, — 133 
—115, —114, —103, — 99, — 93. — 91, —- 76, — 75, — 73, — 70, + 69, — 61 
—5l, —36, —234. — I, 
‘3, 14. 26, Fe 41, 44, 50, 5A, a7, x G69, +t, 77 
31, 132, 14, 147, 156, 159, te, 11, 14, 197, 200, 212 
216, aah, 225, 2, 252, 260, + 24, 270, 282, 205, 302, 309 
$12, B20, + BPA. BAS, 341, DG, 365, + 369, BS0, B90, 404, + 411 
417, $14, $22, $35, 440), $47, 45), 407, 470, 476, 455, 489 
49%, 497, 51s. o24, 5d, 5A, 55D, 5s2, 585, 594, + 609, 627 
629, 630, G41, Oe, OF1, OSG, HOU, 714, 71M, 720, 732, 737 
740, 747, 40a; 762, GF 782, 2 705, SOM, 810, S19, 827 
Bor, 854, 875, S76, 87%, BS4, + SUM. Sti, 929, + 936, 942, 915 
951, 987, YS), 1001, 1ol4, 1019, 1081, 14g, 1059, 1077, 1089, 1094 


1100, 1110, 1115, 1127, 1136, L140, 1154, 1155, 1161, 1167, 1169, 1181 


1145. 


The numbers above that are marked with a double sign, as + 936, oecur 
in both the negative and positive coetlicients. Whenever such is the case, 
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the corresponding numbers in (2”), as + 936(2") + 1, differ numerically by 
2, and compose what is called a ‘‘ prime pair.” There are eight such pairs 
occurring in (2"). 

After the table had been made, the time occupied in applying it to the 
determination of the primes occurring in (2) and (2') was about eight 
hours. This is a shorter period than would be required in multiplying out 


the 2422 numbers in (2) and (2’) and consulting factor-tables like those of 


Burckhardt, Dase and Glaisher, if such factor-tables were extended as far as 
634912769, the largest number considered. All the numbers in (2) and (2') 
in which the coefficient of 2" is numerically greater than 19 lie beyond 107 
and hence beyond the range of the factor-tables. 

In the above case, while the numbers in (2”) range in numerical value 
from 524287 to 634912769, the “sifting” process is applied simply to the 
coefticients of 2", a set of consecutive integers ranging from — 1211 to +1211. 
The power of the general method of this paper is due to just this feature. 
After a table of x,, of sufficient extent to meet the requirements of the case 
in hand has been computed, the sifting is applied not directly to the numbers 
under investigation, but to a range of smaller consecutive integers, the coeffi- 
cients of a’. Thus is obtained a method, not difficult to apply, for the deter- 
mination of the primes and the factorization of the composite numbers of a 
given form xa* + 6, where the investigation of each number separately would 
be quite impracticable. 

9. Application to the factors of Fermat’s Numbers. As is 
well known, all factors of Fermat’s Numbers,* /’, = 2?) + 1, must be of the 
form 

O,=q- 2*t* + 1. 
All the known factors of F', were identified by testing the divisibility of F', 
by primes of the form @,. The primes of the form g.2' + 1, given in §8, 
were tested as possible Fermat factorst for g = 11, 14, 26, - + - 989, with the 
result that no Fermat factors were found in addition to those already known. 
Ilence, for nv > 16, there are no more factors of /', in addition to those pre- 
viously known, under 1001.2" 4 1, or 524812289, 


* F, is known to be prime for n = 0, 1, 2, 3, 4, and composite for 2 = 5, 6, 7,9, 11, 12, 
18, 23. 36, 38, 73. See Proceedings of the London Mathematical Society, ser. 2, vol. 1, p. 175; 
vol. 3, p. Xxt; vol. 5, p. 237; Messenger of Mathematics, vol. 37. p. 65; Bulletin of the American 
Mathematical Society, vol. 11, p. 343; vol. 12, p. 449. 
+ J, e., factors of Fermat's numbers. 





a Se meee een 











-_ 





100 MOREHEAD (January 


By making use of the 55™ and 75™ columns of the table described in §7, 
in its present extent, it was found that neither 5.2 4+ 1 nor 5.2 + 1 admit 
factors under 2000. Subsequent tests showed that both are prime, and that 
5.254 Lisa factor of /’;. The last result was obtained by the following 
method. 


Setting p= 5. 2+ 1, we have 


5.2% +4 1 = 0 mod p, 
and hence, using the notation of §4, 


— a & »11 
E64 = Ley =. 2". 


Then applying (/7) we obtain successively 


&,. = — &, = — 5° . 2% mod p 
f= — 5st, 2H 
f= — 3*. 28 = 5, 24 


En. = 5°. 10%, 


From this stage on, the numerically least remainder (mod p) of each &» was 
found by direct division by p. This process showed finally &-. = 1, and there- 
fore that 27° + 1 is divisible by p. Thus the identification of the Fermat 
factor 5.2° + L was effected sulely by the methods of the present paper,* and 
less arduously, as far as &.., than by the usual method of calculating succes- 
sively the remainders (mod p) of 27, 2%, ... 2°, 

A more detailed report on the application of the methods of this paper 
to Fermat's factors will be held, for the sake of completeness, until the table 
of §7 is further advanced. 


10. Connection with Lebon’s table. Professor Lebon, in a paper 
published in the Lulletin of the American Mathematical Society, vol. 13, 
p. 74, describes a process for the construction of a table applicable to the fac- 
torization of the numbers in arithmetical progressions in which the common 
difference is the simple product of consecutive primes, 2.3.5. +++ p,, and 


the first term is any one of the (2.3.5. .+.p,) numbers less than and 


* This method is stated briefly in different form in Bulletin of the American Mathemati- 
cal Society, vol. 12, p. 237. 
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prime to the common difference. The common difference and first term, 
called the base and index of the progression, are represented by B and J re- 
spectively. The system of $(#) arithmetical progressions whese general 
term is BA + J, where AV is successively equal to the positive integers starting 
from zero, and J is prime to #B, contains all prime numbers except 
2,3, 5,-++ p,, and all composite numbers except multiples of 2, 5.5, . ++ p,. 
The numbers recorded in the table are the minimum values of A’ that 
render BA + T divisible by each successive prime following p,.* Evidently 
a table so constructed is applicable to the decomposition of, or the deter- 
mination of the primality of successive numbers in progressions of this special 
form, as far as p; (the last prime included in the table), or to the inves- 
tigation of the divisibility of isolated numbers in such progressions, in a 
manner similar to that described in the present paper. In fact, if we take 
a= B,b=T1 and fk =1, the numbers +, occurring in my table in the first 
column following the primes coincide, for the primes following p,, with the 
minimum values of Av recorded in Professor Lebon’s table. If the latter 
table is extended to include the ¢( 2) values of J, the complete table is 
clearly applicable to successive integers, aside from multiples of 2, 3, - - + pz, 
and hence applicable to the formation of factor-tables of the usual form. 

However, the method of Professor Lebon is not so easily and directly 
applicable as is the method of the present paper to the investigation of the 
divisibility of numbers of the form awa* + 4 when k& exceeds unity. Thus the 
special form of the numbers i.2/ + 1, treated in §§8, 4%, would be of no ad- 
vantage if the table of Professor Lebon were to be applied. 

1l. Application to ranges of consecutive integers. The method 
of this paper may be extended so as to be applicable to a range of consecutive 
integers without restricting the form of the progressions as in §10. Let a 
and & be fixed. Then, like the Lebon progressions, the system of $(a*) 
arithmetical progressions represented by me* + 6, where m = 0, 1,2, -- + and 
4 assumes successively the 6(a") positive integral values Jess than and prime 
to a*, contains all positive integers that are prime to a. Consequently, a table 
of values of x,» for the primes as far as 7 and for the ¢(a*) values of / would 
suffice, like the Lebon table, for the decomposition, or the determination of 
the primality of all numbers prime to @ as far as 7, 











*A table compiled by Professor Lebon has been published under the title Table de 
caractéristiques relatives & la base 2310 des facteurs premiers d'un nombre inférior a 30030, 
Paris, 1906. 
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The efficiency of such a table, as well as the amount of labor involved 
in the compilation, will be determined by our choice of a and 4. To fix ideas, 
suppose it is desired to construct a table effective to 108. A convenient 
choice would be « = 10, 4% = 4. Then the system of $(10'), or 4000, pro- 
gressions of 10000 terms each, whose general term is m.10* + 4, contains 
every odd number except multiples of 5, from 1 to 10°— 1. To be effective 
as fur as 10°, the corresponding table of values of 74, should be calculated 
for 1226 primes, viz.. 7, 11, 13, -- + 9973.* The following is a convenient 
arrangement of the table. Here 2,,,,9, stands at the intersection of the row p 
with the column /. 








p=i ’ 1| o| 8 . > |. 
41 10 8 rs : 2 Al tne . oF 
y97 432. 299 38) B97. | 956 | 690 


To illustrate the method of applying the table, suppose we wish to 
decompose the number 26840237, = NV, say. It is easily seen that 3 is nota 
factor of Ve. Writing .V = 2684 x 10¢ + 237, and referring to the column 
headed 237, we find 

2084 2 2 mod 7 


+ 


+ 5 mod 11 


= 690 mod 997, 


Here 690 is the first remainder obtained on dividing 2684 by a prime that 
coincides with the corresponding z,,. Hence 997 is the east factor of LN. 
The remaining factor, 26921, is less than 997? and therefore prime. Thus we 
have the complete decomposition, V = 997 > 26921. If forthe primes as far 








* It is useless to include 3, as divisibility by 3 is easily tested. 
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as NV no remainder were found to coincide with the corresponding <,,, then 
would WN be prime. 

The application of the table would be greatly facilitated if a residue table 
were available, giving the least positive remainders of each integer as far as 
10 for the prime moduli 7, 11, . - - 9973. If the arrangement of the residue 
table were similar to that of the above table, then to effect the decomposition 
of the number 26840237, for example, we should need only to compare cor- 
responding numbers in the columns headed 2684 and 237 in the residue and 
factorization tables respectively, until a coincidence is discovered at p = 997. 

The application of the above table has one evident advantage over that of 
Professor Lebon, covering the same range of numbers. We recognize at 
sight the coefficients required to express a number in the form m.10* + b; 
while a division must be performed to express a number in the form 
m(2.5.-+++p,) +4 preliminary to applying the Lebon table. Aside from 
this feature, the two tables differ little in efficiency when a residue table is 
available with each. 

The labor of compilation is not so great as to render the task impractic- 
able. Starting from the value of x) given by (C), §3, we calculate succes- 
sively, for given values of 4 and p, ©, 72, 73 and finally x, by the method of 
§3. Aided by the use of calculating machines, a computer of average speed 
‘an evaluate by this method and record about one hundred and twenty 2,’s 
per hour, when the corresponding primes contain not more than four places. 
The labor may be further reduced by means of a relation which we shall now 
derive. 

Let us write, for brevity, x, for 74,;,. We have, by definition, 


10. vr, + 4 = 0 mod p 


and 
104 . i ;,: oe id 


0 mod p. 
Whence 

LO (0, + 4) + b + b' = mod p, 
and therefore 


(.V ) Soran = Lot Uy mod p- 


When the numbers in column 1 have been found by the method of §3, 
repeated applications of (.V) will give all the remaining columns. However, 
the relation (.W) can be used most effectively if two auxiliary columns have 





i¢ 

j! 
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been calculated, viz., for 6 = 2, 4.* Since two successive values of 4% in 
the table ditfer by 2 or 4, the addition of , or 1, to any «, and the sub- 
traction of p if the sum exceeds p — 1 will give the corresponding 2, 4 , or %, 4 , 
in the next column. Thus for p = 997, we have wy = 864, a = 73 
Hy = Xz + Hy = 33 + BO4 = SMC z Togz = wy, + 4 956 4+ 731 = 690 mod p. 

If a calculating machine is used, a computer can evaluate by means of (1 ) 
and record from four to five hundred «,’s per hour, for primes of four places 
or less. The table may be checked by applying the method of §3 to every 
tenth or twelfth column.f We may estimate, therefore, that a single com- 
puter would require four to five years for calculating, recording and checking 
the x,’s for the entire table of 4900 columns and 1226 primes. 

The generalization of the relations developed in this paper for composite 
moduli, and the incidental application of the methods of deriving .;,,,, and 
2}, to the calculation of power residues are only remotely connected with 


the purpose of this paper and will be reserved for later publication, 


NORTHWESTERN UNIVERSITY, 
Evanston, ILL, 
APRIL, 1908 


* Here} is not prime to ¢, so that these two columns should not be included in the table. 
It is easily seen that (.W) is valid for / = 2, 4. 

+ It is evident from the definitions that 7, = 7, ,. Hence the numbers in any row of 
the table recur in exact order at intervals of 10p in the values of 4. For primes under 1000 
this property facilitates the calculation to some extent. 

: The manuscript factor-tables of Professor Kulik, left in charge of the Vienna Academy, 
extend to 100330201, —a little beyond the proposed limit of the factorization table here de- 
scribed. However, the Kulik tables are at present almost useless on account of the many errors 
they contain. See Bulletin of the Ainerican Mathematical Society, vol. 14. p. 106. 











A METHOD OF INVESTIGATING NUMBERS OF THE 
FORMS 6¢%.s + 1.* 


By Lioyp L. Drives 


In a paper published in the January issue of the Annas, Dr. Morehead 
indicated a method for determining the factors of numbers of the form a‘. s + 6 
where a, i and « are positive integers and 6 is a positive or negative integer. 
He elaborated the case when a = 2, = +1, and considered briefly the case 
whena=10. The present paper will describe in some detail the method as 
applied to the case when «=6,4=41, and will exhibit some results 
obtained. 


1. The Functions +, and x,. Let us consider the congruences 
6.2 —1= 0 mod p, (1) 


H*.2+ 1=0 mod p, (2) 


when / is any positive integer and p is any prime greater than 3. [It is 
evident that (1) and (2) would have no solution for p = 2 or 3.] Solutions 
of each of these congruences exist, and are functions of p and’. For any 
particular value of p, we shall call the least positive solution of (1) 2,, and 
the least positive solution of (2) 7;. From the linear character of the con- 
gruences (1) and (2), it follows that the functions +, and 2x; are uniquely 
determined, each being less than p. 

These functions possess a number of interesting properties, any or all of 
which may be used as a basis of computation of the functions. We will state 
these properties without proof, merely making reference at the right-hand side 
of the page to the general theorems stated and proven in Dr. Morehead’s 
paper. 


2. Properties of x, and -;. 


1°) t+ X= ~p- (§: “ie 
2°) p+ hy = Me pe mod p. ($4, / 
3°) 1 °% = thes mod p. ($4, 2 
4°) bes r= Teter mod a ($4, oli 





* Read be fore the ( ‘hicas xo Sec oe of the American Mathematical Society, eri 17,1 1908. 


(105) 
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») When p is of the form tn — 1, 
(7) 2, = nv, mod p, ($4, F’) 
(4) a, 4, = wry mod p. 
6) When p és of the form tn + 1, 
(¢) t.4, = nvj mod p, ($4, F’’) 
(4) #6.) = ux, mod p. 
3. Method of Computing +, and x). For any tixed value of p, 
and for 4 = 1, the values of x, and «j are easily determined by the follow- 


; . , , {pri 
ing method. We have identically 6 (A ) = 1 mod p. 
‘ ) 


p+ 


Therefore x, = 
) 


mod p, using the ordinary definition of fractional 


, ae . ‘ike ’ »>— | 
congruences when » + 1 is not divisible by 6. Similarly 6 Lert — | 


h 
»— | a »+ 1 p—1., 
l —— mod p. Kither / om / . is integral. 


b a) TB) 


>+ | ) l 
! ., and from property 1°, 7; = p— i . Inthe 


, b 
F —_ 1 ane | 
latter case x) = ! rc and 7, =p — A i 
) ’ 


mod p, and therefore x; = 


In the former case qx 


After 7, and x; have been thus determined, the values of +, and x for 
consecutive values of / can be computed by means of the following recursion 
formulas, 


* 


in each of which » has some one of the values 0, 1, 2, 3, 4, 5:- 


- ~~ gigi 

(A) %41= ae 
;, r+ up 

(8B) ae, = 7. 


) 


The determination in each case of the value of n necessary to make 


de: - np 


- WP an integer less than p, is facilitated by the following simple 
‘ 


: a 


Theorem: ft vy, is odd, n will be 1.3. or 5.) If Wy is even, n will be 2 or 4. 


For, in order to be divisible by 6, x, + “p must be even. If x, is odd but 


*Cf. ANNALS or Matuematics, vol. 10, p. 92 (1909). 








| 
} 
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x, + np is even, then must wp also be odd, and hence n must be odd, i. e., 1, 
3, or 5. Ifa, is even, then must np also be even, and hence » must be even, 
i. e., 2 or 4. 

Formulas (A) and (2) afford the easiest method of obtaining the functions 
x, and x,, while the properties of section 2 can be very profitably used for 
checking the computation. The following table will show the functions x, and 
x, of four consecutive primes for values of & from 1 to 12: 











o h - » oo | + 2 2 - L eleic = 
: Hi/oiadoulod old oed ol ol play 
Be “2 |} = ~ ~ ~ “~< | «e | « | « ae “2 
__| a |131) 48| 8 106) 70| 64 63] 89) 41 33) B4/ 14 
toT) at g6 109/149, 51) 87! 93) 94] 68/116 124) 73/143 
ate BG TT) 40 GL 146) 133 158135 LOL 126) 21) 85 
oes x, | 27 86/123) 102; 17; 30 5| 28) 59) 37/142] 73 

ry, 28 116) 75 96) 16 114 19] 31) 33. 89 154137 
16 1139) 51] 92) 71/151) 53/148 136/134) 78] 13} 30 

rp | 29 149 169 57) 96 1G LIB 135 109-47 15283 
a! M4 24) 4 116) 77.157, 55) 38) 64.126) 21) 90 








4. Quadratic Characters of », and +). In the following section 
we shall show how these functions «, and 2; can be paired off into two sets, 
sich that one set consists of some or all of the quadratic residues, while the 
other set consists of some or all of the quadratic non-residues of any prime 
greater than 3. 


l. (duadratic Character of Wee 
We have by definition, 6%«, + 1= 0 mod p. Hence 


2, =_ FAL mod Is 


) 


@-(2)-6)-GOO  « 


and 
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From the Theory of Quadratic Residues, we know 


when p= 4m+1, 
(— —=)=* - when p= 44+ 3; 


2 +1 when p= 8m+1, 
(5) ~ —1 when p= &8mi3; 


(-) _ +1 when p=12m+1, 


when p= 12m+5 
From these formulas and equations (3), we obtain the followng results : 


24m +1 ) 


or re Ea + 1)(41)*(41)' = + 1. 


When p=: 
24m + 5 | P 


When p= 


.()=- G0! (41) =—- 1. 
Pp 


(#)- (= 1)(41)(= 1) » {t+ 1 for k& odd. 


Wher »=< oor 
en 7 (— 1 for & even. 


(24m + 


] 
) 
ins +7 | 
uy 


( 24m —7 


‘ ¢ 1 for & even. 
When i or ar (=)= + 1)(+1 + I at 
/ | . p ) y"( )" ~|—1 for k odd. 
24m — 11) 


2. Quadratic Character of xy. 
We have by definition, 64”, = 1 mod p. Hence 


q _— ] | 
x). = ra moc Ps 


GarG-o®: 





Snr 
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From (4) we get the following results :— 


24m +1 ) 


When p= ‘ (“)= (+ 1)*(41)*= +1. 


hed +5 


24m — 5 


dae + 7 


When 7 ()= (+ 1)(F1)F= ft+l for & even. 


24m + 12 ad jm 1 for k odd. 


240 —7 


, (*)= (4 1)'(F1)'= ie for / even. 
"ay 


When vite ‘ | | , (=)= (+ 1)*(4 1) = 41. 
awa 
at : 
r-| 


When 


Py - — 1 for & odd. 


We can summarize these results in the following convenient form :— 


24 + 1 
( j = XEN 
(a) When p bea as mf t) GS )=+ 


24m 


(b) When p= 914i — l, (> ‘) = (> mt) = — 1 4 
Sim ~ 5, (“) = <= j, (=) a Me 
P p 


(c) When p 


aged 1 for & odd. 
V4 = xy f+ or £0 
«) — .G “t) = | {— 1 for & even. 
hina +11 
(= ” (+ 1 for & even. 
=) ~ |—1 for k odd. 
24m — 7 


(e) When p 


+1 for k even. 
“t) = (= -{* 1 for *& odd. 
lai 
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As an illustration of the application of these results, we will find all the 
quadratic residues and non-residues of 67. 
The values of 2, and .. for the prime 67 and for 4 = 1, 2, --- 33, are: 


k 123s 45 6 7 8 910 11 12 13 14 15 16 17 


x, 11 13 58 32 50 53 20 48) 8 46 30 5 12 2 45 41°18 


x, o8 D4 OY 35 17 14 47 19 59 21 37 62 55 65 22 26 49 


k 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 
2} 3.54 28 27 58 51 420 7 57 45 65 44 52 31 61 66 


x) 64 35 30 40 29 16 25 60 10 240 4-23 15 36 61 


Now 67 = 24m — 5. Therefore from (Cc), the quadratic residues are: 


aa, 39, 40, 29, 16, 25, 60, 10, 24, 4, 23, 15, 36, 6, 1. 
The quadratic non-residues are : 


33. 13, 58, 32, 50). 53. 2), 4s, S, 46, 30) d. ‘2. a £5. Pil. 18 » 34, 


98, 27, 38, 51, 42, 7, 57, 43, 63, 44, 52, 31, G1, 66. 


An important use of the residues and non-residues obtained in the above 
manner is found in their application to the factorization of single numbers of 
the form 6’s + 1. 


5. Application of Functions ,, and -; to the Factorization 
of “ Ranges” of Numbers of the Forms ‘s+ 1. 9 Consider any 
number of the form 6*s + 1. 9 The necessary and sutlicient condition that this 
number be composite is that there be a number p, such that the congruence 
64s + 1=0 mod p be satisfied. In other words, the problem of factoring 
this number reduces to the problem of finding a prime number p, less than 
the square root of 6’s + 1, such that the 2; function of this number is either 
equal to, or congruent to s, modulo p. In the same way, to factor any 
number 6s — 1, it is necessary and suflicient to tind a prime number p_ less 
than the square root of 6’s — 1, such that its 7, function is equal to, or con- 
gruent to s, modulus yp. If such a number does not exist, in either the first 
case or the second the number under consideration is prime. 
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A table* has been computed by the author, which shows the functions 
x, and .}, of all primes from 5 up to 10,000, for i = 1, 2,3... 12. By atable 
with such limits, it is possible to investigate completely all numbers of the 
forms 64s 4 1 (4 = 12) up to 100,000,000 and to obtain all factors under 
10,000 of numbers of these forms over 100,000,000. The power of the table 
is better realized when we consider that we now have no factor tables over 
10,000,000, 

The table is most profitably used to investigate ‘*ranges” of numbers of 
the forms 6*s 4 1 for any fixed value of / and for a series of consecutive 
values of s. The method used is entirely analagous to the method of the 
Sieve of Eratosthenes for determining prime numbers in general. 

We first write down the values of s between the limits considered. 
Then, beginning at the first of the table, we look in the column of the partie- 
ular value of & which is being investigated. If we are considering the form 
(i's — 1, we look only at the x, functions. If our form is 6"s + 1, we look 
only at the x; functions. If the x, (or 7;,) for the first prime, 5, is among 
the values of s which we have written down, we cross it out, together with all 
numbers congruent to it modulo 5. All numbers with these values of s are 
divisible by 5, because by the definition of x, (or 7,), this value of s will 
satisfy the congruence 6's = 1 = 0 mod 5. Then if the x, (or #/) of the next 
prime, 7, occurs among the values of s, we cross it out together with all con- 
gruent numbers modulo 7. All numbers having these values of s are divisi- 
ble by 7. Continuing in this way, we cross out all the remaining values of 8 
which oceur as x, (or «;) functions together with numbers which are congruent 
for the corresponding modulus. All numbers having these values of s are 
divisible by the corresponding modulus p. This process need be carried on 
only until p reaches the value \ 6's ¢ 1, where s has its maximum value in the 
‘¢range,” since a number is prime if it has no factor less than its square root. 
For all values of s remaining after this process, 6’s — 1 (or 6’s + 1) is 
prime. 

We will now give a few results obtained in some ‘‘ranges” by this 
method, 

Runge 1. 68s 4.1. s = 200, 201, -- - 300, and 2150, 2151, -- - 2250. 

The numbers are prime for the following values of s: 207, 215, 215, 225, 


256, 243, 270, 282, 295. 





— 





*In manuscript at the Library of Northwestern University. 
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The numbers corresponding to the following values of s have no factors 
under 10,000: 
2150, 2165, 2166, 2170, 2171, 2175, 2190, 2203, 2225, 2226, 2227, 2241, 
2242, 2247. 

These numbers lie between 9,351,201, and 104,976,001. 


Range o §%s — I. s= 200, 201, -- - 300, and 2150, 2151, - - - 2250. 


The numbers are prime for the following values of s; 200, 205, 214, 217, 
224, 234, 238, 239, 269, 288, 293, 204. 

The numbers corresponding to the following values of s have no factors 
under 10,000: 
2163, 2168, 2172, 2180, 2184, 2189, 2205, 220%, 2 


2250. 


~ 


212, 2214, 2217, 2240, 


These numbers lie between 9,531,199, and 104,975,999. 
Ranged. 6's +1. a2=1,2,.--.- 400. 
The numbers are prime for the following values of s: 
3, 10, 17, 18, 25, 33, 45, 47, 52, 53, 60, 62, 75, 83, 86, 100, 110, 112, 


121, 126, 131, 133, 138, 140, 145, 150, 163, 165, 170, 171, 172, 182. 186, 


187, 200, 215, 226, 237, 241, 242, 255, 256, 272, 273, 276, 280, 283, 236, 


-s =- - 


293, 300, 303, 307, 313, 335, 341, 347, 348, 353. 


The numbers corresponding to the following values of s have no factors 
under 10,000; 
561, 371, 385, 391, 396, 398, 


These numbers lie between 279,937 and LLI,{74,401- 
Range k. 6% — I, s=1,2,--- 400. 


The numbers are prime for the following values of s: 


9, 33, 39, 49, 53, 64, 65, 75, 78, 82, 98, 105, 108, 124, 127, 133, 134, 
2 


42, 15 


1 , 163, 168, 175, 79, 185, 187, 189, 199, 203, 213, 214, 219, 
224 : 


. 225, 229 7, 278, 285, 287, 297, 


9 


298, 308, 310, 312, 31! 





aE i 
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The numbers corresponding to the following values of s have no factors 
under 10,000: 
362, 364, 369, 375, 3X3, 390, 394, 400. 


These numbers lie between 279,935, and 111,974,399. 
Range 5. 68s + 1. s=1,2,--.- 100. 
The numbers are prime for the following values of s : 
3, 10, 21, 23, 25, 31, 46, 50, 58. 


The numbers corresponding to the following values of s have no factors 
under 10,000: 
66, 67, 72, 87, 93. 


These numbers lie between 1,679,617 and 167,961,601. 
Range. 6%s — 1. s=1,2,--- 100. 
The numbers are prime for the following values of s: 
13, 18, 28, 30, 40, 55, 59. 


The numbers corresponding to the following values of s have no factors 


under 10,000 : 
65, 70, 73, 79, 83, 95. 


These numbers le between 1,679,615, and 167,961,599. 
Range 1. et fi. s=1,2,--- 100. 


The numbers corresponding to the following values of s have no factors 
under 10,000 ; 
10, 11, 12, 26, 47, 48, 51, 53, 55, 62, 65, GR, 72, 73, 81, 87, M1, 97, 8, 100. 


These numbers lie between 10,077,697, and 1,007,769,601. 


Range 8. 6's — 1. San EO. «oo 10, 


The value x = 3 gives a prime. 
The numbers represented by the following values of s have no factors 
under 10,000: 


19, 35, 38, 43, 53, 70, 75, BO, 85, 89, 98, 
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These numbers lie between 10,077,695 and 1,007, 769,599. 
Reonge®. 6s +1. s=1, 2,--- 100. 


The numbers represented by the following values of s have no factors 
under 10,000: 


2, 8, 12, 18, 28, 31, 52, 53, 56, 58, 70, 71, 95. 
These numbers lie between 60,466,177 and 6,046,617,601. 
Range 10. 6s — 1. s=1,2,.--. 100. 
The numbers represented by the following values of s have no factors 
under 10,000, 
1%, 24, 30,37, 58, 44, 53, 58, 60, BS, 69, 70, 72, 87, 94. 
These numbers lie between 60,466,175, and 6,046,617,599, 


The numbers which have been designated as having no factors under 
10,000 have been spoken of thus because the range of the table used is not 
great enough to determine absolutely that they are prime. Probably most of 
them are prime. 

A number of very interesting observations can be made from the few 
results which we have tabulated. Notice, for instance, the consecutive 
values of s which give primes (or for those above 100,000,000 probably 
primes) :— 


Range 1: 2165, 2166; 2170, 2171; 2225, 2226, 2227; 2241, 2242. 
Range 2: 238, 239; 293, 294. 


Range 3: 17,18; 52,53: 170, 171, 172: 186, 187; 241, 242; 255, 256; 


Range 4: 64, 65; 133, 154: 215, 214; 233, 234: 277, 278: 207, 298. 
Range 5: 66, 67. 
tange 6: 39, 40, 


Range 


Range 9: 52, 53; 70, 71. 


Range 10: 37, 38; 68, 69, 70. 
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More interesting than these, perhaps, are the “prime pairs” (primes 
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differing by 2), which are found in these ranges. 


These occur, of course, whenever any value of s makes both 6*s + 1 and 
The following “prime pairs” (those over 100,000,000 prob- 


6*s — 1 prime. 


ably prime) are found in the above ranges :— 


68 
6° 
) 
= 


0° 


6! 


oe. 
, 
(10 . 


(10 3 


The numbers of the first pair mentioned, 1. e.,"102, 783,167 and 102,783, 
160° fall above one hundred million, the limit of the table above described, but 
they have been further tested and found to be prime. 
constitute the largest prime pair known at present. 

In conclusion we would call attention to the advantages of this method in 
determining prime numbers in general. 
of the forms 6s + 1, if we take 4 = 1, in the method described above, and 
let s have consecutive values, all primes may be computed by making use of 
the first column of the table. The advantages of this over the ordinary “sieve” 
method are that the numbers to be handled are very much smaller, and only 


33 + 
53 4+ 
7) + 
133 4 
163 4 


187 + 


98 + 
53 
5B 4 


70 4 


- 2203 41 


] 
1 
l 


102,783,167 


9,237,887 
14,836,607 
20,995,199 
37,251,487 
45,629,567 
52,348,051 


71,383,679 


534,117,887 
987,614,207 
3,204, 707,327 
3,507 ,038,207 


4,232,632,319 


one third as many have to be examined, 


NORTHWESTERN UNIVERSITY, 





EVANSTON, ILL. 


and 
and 
and 
and 
and 
and 
and 
and 
and 
and 
and 
and 


and 





102,783,169 
9,237,889 
14,836,609 
20,995,201 
37,231,489 
45,029,569 
52,348,033 
71,383,681 


534,117,889 
987,614,209 
3,204,707,329 
3,507 ,038,209 


4,252,052,321 


They are believed to 


Since all primes greater than 3 are 





THE SOLUTION OF ALGEBRAIC EQUATIONS BY PARTIAL 
DIFFERENTIAL EQUATIONS 


By H. A. Sayre 


1. Let the algebraic equation of degree n in z be 


27+ nzz+ ny =U. 


Let the solution of this equation be z = F(x, y). With the usual notation for 


partial derivatives with respect to x and y, we have 


2 1 
P=—p-14 2” =~ 7-14 2° 
2zx —(n —3)z" xz —(n— 2)z"-! 


we (271 +x)3 (2"-! L x)? 9 


(a — 1)2"-# 
ake Rear 
(s*~* + “)° 





a 





= (z*-! + x) 


8 (9n — 2n? — 9)2**—-?2 + (n? + Sn — 8)2"*-' ae — 22? 





(2" = l aike sy 


_ _ (A — 1)2"—7)} (4 — 2n)z"—1 + (un + 1)! 





(2" 1 + xy ’ 


5- (n—1)(1— 2n)zin—* + (wn —1)(u — 2)xr2"-3 
(2"—' + x) ee 





Among these derivatives there exist relations such as 
P-—gqz=%, 
rt — s* — pot + q?s = 0, 


p22? 4+ qr4+1=0, 


z= (n—1) pr ngqy. 


(122 — 2n? — 16)2°"—! + 2%r(n? + bin — 19)— 622? 





,’ 
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When » is assigned it may be possible to form a differential equation, into 
whose solution z and the arbitrary functions enter in such a way, that when the 
algebraic equation is considered as a particular solution, the integral equation 
reduces to 

z— F(z, y) =9, 


where F’ is a known function. If so, the algebraic equation is solved. 


2. Solution of the Quadratic. For x = 2, pz"-? + qz+1=0, 
reduces to 
Ptqe+1=0. 


Lagrange’s subsidiary equations are 

dx dy dz 

[ ss «=f 
Two integrals are 

xv? — 2y = constant, z+ x= constant: 
hence the general integral is 
z=—2+ $(2*-2y). 
¢@ must now be determined so as to make 
2422+ 2y=0 

a particular solution. 


1°. In the quadratic, when x = 0, we get 
z=+V —2y, hence $(— 2y) =+V — 2y, 
which determines the form of ¢. The integral equation now reduces to 
z= —2xtVx? — 2y, 


Which is the solution of the quadratic. 

2°, Substituting = = — 2 + in the quadratic, we get $? = 2? — 2y. 
This is the well known solution in which the second term is removed by a 
linear substitution. 

3°. The integral equation suggests changing the variables from x and y 
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to x and v, where v = 2? — 2y. The quadratic becomes 


24+ Qer + 2? = v, hence z+ 7 = 4 Ve. 


This is the common method of completing the square. 
Among the differential equations which may be used for solving the quad- 
ratic are the following : 


» (= 2= (4) + ya? — dy. 

2. rxet+sx?—-p—-1=0, z=—2 + $(2?— 2y) + (y). 
3. s8+tx=0, z= v(x) + $(2? — 2y). 

4. Be+ ya? —s=), z= O(r) + v(s? — 2y) + Oy). 


3. Solution of the Cubic. For the cubic we have r+ le=0. 
When xn = 3, 2 = (nu — 1) pe + ngy becomes 
z= 2pe+ 3yy, 


whence p = 2rxe + 2v + 3sy. 


Multiplying r+ ¢t2 =0 by «, and subtracting from the preceding equation, 
we get 
ret+ dsy — tz? + p=. 


To integrate this equation let us use Monge’s method. The subsidiary 
equations are 
dy _ dy + Voy? + 428 


du 2x 
43 
2d ( pr) _— ———. —_—___—__— dy = ), ' 
BY ANID Y? + 4x5 ' 
from which we have 


— sy + V97? + 422° = 4, 
2pe —(—3y + VY? + det)g = ey 


on OY = Voy? + 43 = C3, 


Ype —(— By — V97 + 4e3)q = &. 





sone 
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The intermediate integrals are 


2px — (— dy — V9¥? + 4x*) 7 = O (— By — Vy? + 4z4). 


Putting «= — 3y, v= V9? + 423), we get 





ha re i wy, Bi -8s, 
dav op Ys 

or 

de = }fu+ v] O,(u— v)—(u—v) O(u +r) ) (rdv — udu) 





“Gui? — wt) 


du 





= 0,(u -_— ”) _ 6, (4 + 1) 
6v 
Ee 6,(4 — vr) 


——d(w—r) + 


O,(u + v) 
~ 6(u —v) ¢ d(u+v). 


6(u + v) 
The integral is 
z= o(u+v) + v(u—r) 
= $(— By + VIP FB) + Y(— By - VOPR + Ee), 
where ¢ and y are arbitrary functions. 
Putting — 3y + V9y + 423 = 2m, — By — V9? + 42° = 2”, the general 
integral becomes 
z= $1(%) + Wil"), 
while the cubic transforms into 
2 — 32 \ UY} — (4 +7,;) =90. 


The initial values for the cubic are 


uy vy z 
0 0) 0 
0 Vv) Vey 


0 
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Substituting these values in the general integral, 


0 = (0) + ¥(9), 
Ve, = (0) + ¥i(")s 


Vig = iC) t Wil). 


By addition 


yar "1 = O)(%) + Wil") 3 
hence 


z= VG + V4 
restoring the original variables, 


* ay + Vig a i 


3/_ 3Yy ~% a? + 423 
-) ities ae . are 


) 





= 3 e . ixsihle ; 
Since Vi, 4 = — “> the admissible roots are 


2=VM t+ VM, 
qe z= VV tetyry, 

naw Vag4 w Ve, 

where 1+ wi? =O, 
4. Solution of the Biquadratic. For the biquadratic 
z= 500 + 4qy, p=sre + Sp + dsy. 

Put A= 3rrd4ey+2p=0, Ba=dar+ 4 By tbr =0, Caa+ br=0. 
Then 


Be — Cx? — A = 2ax* — br + ABry + 2re— Asy — 2p =0 


is the equation that will be considered. Changing the variables from x and 
Sz 
=, and 


bE; 
o§; 
¢ 


_, Will vanish if 


6&; 


y to &, and &,, the coefficients of 





ORT DN 


a 


+a. 








1909] THE SOLUTION OF ALGEBRAIC EQUATIONS 


where m, and m, are roots of the cubic equation 


‘ 4, 2? 
hig Se eas = on 





< oe 
m= > \ re 








; 3 /| x x ys 2 x? 
e = © “+e — —* 1 oe — 
a ee ee 
The differential equation now becomes 


a3 a 


C 5 bay 


CECE, OE, CE 


0, 
and has for its solution 
z= $(£:) + ¥(&) + (Es), 
where &,, &,, & are roots of the cubic equation 
fp ye T=0. 


The biquadratic may be written 


yp 
27° 
x y? 


~ N64 277 


zs + Sze VE,F&; — 4(&, &, + £,&, + E,&,) aos 0, 


where e& = 1 and ¢ has the sign of x. 


Permuting &, &, &; leaves the biquadratic unchanged, hence 


z= $(&,) + o(&) + $(§)- 
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Substituting this value of z in the biquadratic, we have 
zt —2 [67(&,) + $(&) + O7(&)] 2 — 8h6(E1) O(E) O(&) 2 
+ [H°(E1) + H°(E2) + H7( Es) }? 
— 4[G7(E,)G7(E,) + $°(E,)67(E3) + $7(E;)67(E,) ] = 0. 
Identifying this equation with the biquadratic, 
$°(E1) + H°(E) + $7(Es) = 0. 
(£1) $(E2) O(Es) =— € VEL ELE, 
$°(E1) H°(E2) + HE) $7(Es) + H(E;) G7(E,) = EE, + Geb + Gb, 


together with &£+8&4+8&=0. 
Hence 


$(&,) = « VEL, 6(E:) = €, VE, (&;) = € VEs, 
where ¢j = €'= €; = 1, and e,e,€, = — €. 


The solution of the biyuadratic is therefore 


z=, VE, + €, VE, + €; VE;. 


UNIVERSITY OF ALABAMA, 
UNIVERSITY, ALA. 
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THE IN- AND-CIRCUMSCRIBED QUADRILATERAL 
By W. E. Byerty 


In 1792 Fuss* showed that if the radii r and J? of two circles and the 
distance a between their centres satisfy the condition 


((22 + a)? — r*)[(2 -—a)?— 2?) = 9r,~ [1] 


a quadrilateral can be drawn that will be inscribed in the latter and cir- 
cumscribed about the former. 

Six years later Fusst carried his investigation further and obtained the 
conditions that certain polygons of more than four sides could be inscribed 
in one of two circles and circumscribed about the other. 

In 1822 Poncelet in his Proprietés Projectives proved a theorem that 
greatly increased the value of Fuss’s results:—“If a polygon of n sides be 
inscribed in one conic and circumscribed about another an infinite number 
of polygons of n sides can be inscribed in the first conic and circumscribed 
about the second, and moreover if from any point on the circumscribed 
conic a tangent be drawn to the inscribed conic, from the point where 
this tangent again meets the circumscribed conic a second tangent be 
drawn, and this process be continued, the nth tangent will intersect the first 
on the circumscribed conic and a closed polygon of x sides will be formed.” 

In 1860 Mentiont took up the problem of the polygon of x sides in- 
scribed in a circle and circumscribed about a second circle and solved it 
for values of x from 3 to 11, giving as the condition for the case x = 4 


(R+r+a)(R+r—a)(h-r+a)(l—-r—a) = 0 (IT) 


l 

¥i 

The problem of the In- and-Cireumscribed Polygon, or as he calls it the 
Porism of the In- and-Circumscribed Polygon seems to have greatly interested 
Cayley who wrote several papers§ on the subject taking up the problem for 





* Nova Acta Petrop. 1792. 

t Nova Acta Petrop. 1798. 

+ Bulletin de V Acad. de St. Petersburg. 1860. 

§ Cayley’s Collected Works, vols. I, If, and IV, passim. 
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two conics and dealing with two circles as a special case. In one of these 
papers* he parallels Mention’s work for values of n from 3 to 9. 
For the quadrilateral Cayley’s condition is 





_ ; 4 Ip? — Pet F? — OR? 
R?—a [= + rt + at — 20? — 2a Re 2 le 1|= 0. IIT] 


Fuss’s condition [I] and Mention’s condition [II] both reduce to 


RR + at — 2a??? — 2A? — 2h = 0 LIV} 


‘ 


and are identical. Cayley’s condition [III reduces to 
(2? — a?) (RR + at — 202? — 2AT? — 2?) = 0 [Vv] 


and includes the older ones together with the new condition 7? — a? = 0. 

In the papers to which I have referred, the quadrilateral problem has been 
treated incidentally and without detail. I wish to consider it’ independently 
and from a decidedly elementary point of view. 

Let the eqjuations of the inscribed and circumscribed circles be 
respectively 


x Se — yr, (1) 
(x—ayi+ y= 2, (2) 


and suppose that a quadrilateral can be inscribed in (2) and circumscribed 
about (1), 

Then by Poncelet’s Theorem we can start at any point of (2) and draw 
tangents to (1), from the points where these tangents meet (2) a second time 
we can draw tangents to (1) and they will intersect on (2): and we shall have 
a quadrilateral inscribed in (2) and circumseribed about (1). 

If we take as our starting point (@ + /?, 0) and draw our quadrilateral 
as indicated above, it is evident from considerations of symmetry that the last 
vertex formed will lie on the Y axis and therefore that it must be one of the 
two points in which (2) meets the VY axis, i. e. («@ — 2, 0) or (a + &, 0). 
Cuse 1. Suppose the last vertex is at (a — 22,0). Then each of the 
tangents from (@ + J?, 0) will form with a tangent from (« — 2, 0) an angle 
inscribed in a semi-circle and therefore a right angle. 





*Cayley’s Collected Works, vol. IV, p. 292, et seq, 
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In Figures 1 and 2 the right triangles O7'C and O7"B are equal. Hence 
OC=\a—R|. 
In the right triangle AOC the legs are a + R and |v — d?| and the 

















Fig. 1. Fic. 2. 


distance from the vertex of the right angle to the hypothenuse is r. 


Then pa (at R)y( a—R ) ; 
Via + 2)? + (a — R)* 
1 l | l wn 


| a r v1 
ol yr? (a j Rh)? (uw ao I)? L J 


This condition (VI), which is equivalent to [1] and [II], the conditions 
obtained by Fuss and Mention, is therefore a necessary condition. That 
it is sufficient is easily shown as follows :—In figures 1 and 2 through uf draw 
the tangent A7' to the circle (1) and from 2 drop a perpendicular upon it. 
Assume [VI]; then OC ='a—W2!, OTC and O7'B are equal triangles, 
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OT' = rand therefore BT" is tangent to the first circle. Condition [VI] 
then is necessary and suftticient. 

Cuse IL. Suppose the last vertex is at (a — Z?, 0). 

The second pair of tangents coincide with the first and the quadrilateral 
is of course a degenerate one. 

In this case the points of contact of the tangents from (a + /?, 0) must 
lie on the first circle, perpendiculars to these tangents at their points of con- 
tact must pass through (« — J?, 0) and also through the centre of the first 
circle. J? must then be equal to « or to — a, 1. e., 


a=, [VI1j 
That [VII] is sutlicient as well as necessary may be shown as follows :— 
Let R=«. From A, (a, 7), drawa tangent to the first circle and drop a 








perpendicular upon it from 2, (a4, — a). Join with O the intersection P of 
these perpendiculars, which of course is a point on the second cirele. 

The angle OP, being measured by one half the quadrant OZ, is an 
angle of 45 degrees. O/? bisects the angle BPT, and O is equidistant from 
ATand BP. Therefore BP is a tangent and condition {VII} is sufficient as 
Well as necessary. 

The complete condition.eayering Case 1 and Case IL is, then, [III] the 
condition given by Cayley. 

The following construction by which an inscribed circle corresponding to 
any given circumscribed circle may be found is interesting. 
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Join any point 7 on the circumference of the given circle (Fig. 4) with 
the extremities .1 and 2 of a horizontal diameter and with the extremities C’ 
and D of a vertical diameter. Taking as a centre any one of the four points 
VY’, O", C, D, where the second pair of lines, which are easily seen to bisect 
the angles between the first pair, intersect the horizontal and vertical diame- 
ters, with its distance from either of the first pair as a radius, describe a circle 
and it will be the circle required. 











Fic. 4. 


The first pair of lines together with their reflections in the horizontal 
diameter will form a quadrilateral A/?BP’ inscribed in the given circle and 
circumscribed about the two circles whose centres O' and 0” are in the 
horizontal diameter. 

The first pair of lines together with their reflections in the vertical 
diameter will form a quadrilateral APBP" inscribed in the given circle and 
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circumscribed about the two circles whose centres C and J) are on the 
vertical diameter. 

The first pair of cireles, one of which is wholly within and the other 
wholly without the given circle, come under Case I and satisfy condition 
rVIj. 

The second pair of circles, each of which lies partly within and partly 
without the given circle, come, under Case (II) and satisfy condition [VIT). 

Every circle that can be combined with the given cirele to form an in- 
and circumscribed pair can be obtained by the construction just given, followed 
if necessary by a rotation through a suitable angle about the centre of the 


given circle. 


HARVARD USIVERSITY, 
CAMBRIDGE, Mass. 











APPLICATIONS OF PROBABILITY TO MECHANICS 
By Epwix Bipwe_it WILson 


Introduction. Tue problem of the application of the theory of prob- 
ability to the motion of a dynamical system and in particular to finding, under 
definitely specified assumptions, the average values of the important quantities 
connected with the motion lies quite within the domain of investigations in pure 
mathematics. Ina broad way the theory that thus arises may be termed séat/s- 
tical mechanics; from a narrower point of view, however, statistical mechanics 
may be considered as embracing only so much of the theory as may be built 
upon the special assumptions which lead to the Maxwell law for the distribu- 
tion of velocities and to the theory of the equipartition of kinetic energy. 
Hence it is that the advances in statistical mechanics have been made almost 
exclusively by mathematical physicists at first interested in the kinetic theory 
of gases and subsequently concerned with finding a mechanical foundation for 
thermodynamics, a mechanical formulation of entropy and the law of degrada- 
tion or dissipation of energy.” 

The treatment of these physical analogies requires the systems which are 
taken into consideration to have a very great number of degrees of freedom 
and consequently introduces into the subject of statistical mechanics, in its 
broadest sense, a complexity which is by no means an essential part of it and 
which goes a long way toward making dificult the acquisition of the elements 
of the theory. ‘To set forth, as simply as may be, the essential and clementary 
steps in the application of the theory of probabilities to dynamical systems is 
the object of these pages ; and for this reason the number of degrees of freedom 
of the systems considered will at first be kept as small as possible. The 
amount of mathematics which is required is small and is amply covered by the 
discussions of the application of definite integrals to determining average 
values of quantities continuously distributed, which may be found in the usual 


texts on integral calculus. t 
* Especial mention may be made of Maxwell, Boltzmann, Planck, Gibbs, Zermelo, 0. Rey- 
nolds, and Jeans. 
+See Byerly’s Jntegral Calenlus, chap. 15; Todhunter’s Integral Calculus, chap, 14; 
Williamson's Jntegral Calculus, chap. 12. 
129) 
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As it is necessary to have clearly in mind what the term probability 
means when applied to continuously distributed magnitude, it may be well 
to recall the detinitions with the aid of an example. Consider what meaning 
shall be attributed to the expression “the average ordinate in a semicircle” 
(which may be assumed to lie in the first and second quadrants). To find the 
average of a finite set of quantities 7, 72, +--+. 7, Which are repeated respec- 
tively 1, 2, +++, m, times (the w’s are called the weights of the quantities), 
one writes 
Qi 1 + Jot +> °° T Tal n ae we wn 


—— ——_—_——__—_——— = 4\ : Th=<s = In ————e 
. i . . N . = Nv . Ss . 
TOs Le ie i . -+ Uy, al; all; all; 





~y 


where 4 denotes the average. The second form of 4 is interpretable in terms 
of probability. For there are altogether the number 2; of the 7’s and there 
are w; of the uantities 7;: and hence the probability or chance that a 7 picked 
out at random is q; is the ratio w, S;. Thus 

Qq =U + eP2 t+ * > + nPn 
is also the value of the average, where p,; denotes the chance or probability of 
an arbitrary 7 being 4;. 

Now take up the question of the ordinates in the semicircle. There are 
an infinite number of the ordinates and the probability that an ordinate taken 
at random be a specified ordinate is zero. ‘The definition of average as given 
for a tinite number of quantities is illusory. To obtain an average it becomes 
necessary to make some specific assumption as to the distribution of the ordi- 
nates. If in the first place we assume that the totality of the ordinates may 
be represented by a finite number x of them which are uniformly distributed 
along the diameter so that there is one ordinate y in each equal interval dx of 
of the diameter or that the number of ordinates is proportional to the length 
of the portion upon which they stand, then the weight to be attached to the 
ordinate or ordinates between and x + dz will naturally be taken to be dz 
or any multiple Adz of dz and the average will be 


ye, 4. pl ry hese ye, YL ydz 
~ day + dz, ‘ ; a dy, ~ Sdx 
In this manner the infinite number of ordinates is really replaced by a finite 


but very large number which are spaced or distributed in a definite manner. 
To obtain a definite and unique result the number of ordinates may be imag- 
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ined to increase without limit and the individual intervals dx to decrease 
without limit so that the average of the ordinates is 


dx 


a 


a, 


+a 
Lydx _ [ _ -{* dx 


where a is the radius of the semicircle. 

It would be possible to make an assumption of distribution which differed 
from the foregoing. We might for instance imagine that the ordinates were 
suspended at equal intervals from the circular arc. In this case it is clear that 
there must be relatively fewer ordinates over a portion of the diameter near 
the center and relatively more over an equal portion near the extremities. In 
fact the weights which are now to be attached are proportional to ds, the ele- 
ment of arc on the circumference. Under this assumption the average of the 
ordinates is 


“+a 
" yds , 
y= lim 9% = al - ao ds _2 a 
’ Yds [oa = Vana = 
as 
rs 


Thus it is seen how vitally the value of the average of a quantity distributed 
continuously depends on the assumption which is made relative to the distri- 
bution. 

Another point to mention is that the probability that an ordinate lies 
between x and x + dx is the weight divided by the sum (the integral) of the 
weights, that is, it is 


de: de ds ds 
Pde = — = > Od Md = = 
™ zu Te Ta 
da . / ds 
J=~ a J~@ 


in the two cases discussed above. It is of course an infinitesimal with dv. 
The quotient of this infinitesimal probability by the differential dx is called 
the probability function or the frequency: thus under the two above assump- 
tions we have respectively 
l 1 ds l 
P= ' and P=— —= 


2a ~ made ava? — 2 


a ees re . ae 
ee ee Oe 


=. ~ 
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The only difference between the probability /’dx and the weight is that the 
weight may contain an arbitrary factor of proportionality whereas the proba- 
bility is limited by the relation that the sum of all the probabilities represents 
a certainty, that is, is equal to unity. In this way the value of the factor of 
proportionality is tixed. 


1. Time-averages in a dynamical system with one degree of 
freedom. Consider the motion of the simple pendulum, or better, the motion 
of a cycloidal pendulum which may execute vibrations of any magnitude and yet 
be governed by the equations of simple harmonic motion. Let the mass of 
the oscillating particle * be taken as unity. Then 

ar de 


=—nZ, v= = 
dt? wis dt 


=n ya? — x, v= asin(nt+ 7). 


What is the probability that the vibrating particle be found between the 
values ¢ and 2 + dx of its coordinate? The time that the particle spends in the 
interval dz is obtained from its velocity as 


da ng 
dt = = d 


v Wa ry 
Hence if the natural assumption be made that the probability of the particle 
being in the interval dz is proportional to the time spent in traversing 
that interval, the probability will be 


ede ha fel. 
ae | — 


nya? — x Ky a— 4° 





as an additional condition due to the fact that the particle must surely lie 
between the limits —a and +a which give the amplitude of the vibration. 
The determination of 4 gives the value 

] u 
[te du “s 7 
[. nya? — s* 


k= 








* We shall speak of a particle merely for the convenience of having a definite con- 
crete conception before us. It should be distinctly noted, however, that the discussion depends 
in no way upon this convention, but holds equally well for any system whose motion js ade- 
quately represented by the equation z = — n?x. 
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This determination of the probability fixes the value of the average of 
quantities connected with the motion. For if u be any quantity of which the 
average is desired, it is merely necessary to form the integral of u times the 
probability over the whole interval. Thus 


"+S dx 
“x= a — ————— .- 


-a 7 nVa? — x? 


If it be desired to average u over only a part of the interval, the probability 
for that partial interval need not be computed ; it suflices to regard the proba- 
bility already obtained, or any quantity proportional to it, as a weight and 


[a 
et —————— 
: \ a" Ens a . 


" ae arene —dSa<BPS+a. 


write 


Thus the average value of the square of the displacement is 


i =a’; 
T 


1 [" «de 


—« Va _ x 


and the average value of the displacement itself is 


1 ft+¢ adx 
z£=>- a. ae 0, 
TJ-a Yu? — 2" 


as was to be expected from the physically obvious fact that the particle 
vibrates in a manner symmetrical with regard to the origin x = 0, 

Again, if it be desired to find the average value of the displacement 
taken without regard to sign, it is possible, in view of the symmetry, to 
take the average of x over positive values and write 


1 {* adz l 
= 7% a = ad 9 
_ Th Var — x T 2 


dx l - 
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The same procedure would be possible in case of any odd power of x. In 
a similar manner the average velocity of the particle may be found as 


-[ vdx 
th Ve@—e mle, 2 


a 
ea en = -- | dx = —na. 
4 T Wo Tv 





‘Lo find the average of the kinetic energy the whole interval may be con- 
sidered. Then 


ser2=2 | . Va? — 2? dx = $ na. 
‘ wT =e 

Such averages as these which have been obtained are called (ime-averages 

because they are evaluated upon the supposition that the probability or 

weights have been determined by the assumption that the weight is propor- 

tional to the time spent in traversing the interval. 

That the average is obtained on the supposition that the time is the vari- 
able according to which the distribution is uniform may also be seen from the 
formula for the probability. With the time as independent variable the intini- 
tesimal probability would be written as /’/¢. Hence it follows that 

Pdt = : nol - = ; ~ - dt = es dt, 


T ya? — 2 yo? — a? al T 


yim 


and the probability function or frequency appears as the constant n/m and in- 
dicates uniform distribution with respect to ¢. The process of finding the 
average of any quantity uv may be likened to thatof finding the average ordi- 
nate in the semicircle. For if u, which is some quantity connected with the 
motion and hence is a function of ¢, be plotted as the curve u = u(t) between 


the limits ¢= — y » and ¢= 7/n — y/n which correspond to a complete vibra- 
tion of the particle from «= —a to x = +a, then the average ordinate of the 


curve will be precisely the time-average of u, provided the average ordinate is 
found on the assumption of uniform distribution of the ordinates relatively to ¢. 
Frequently, however, it is preferable to follow the vibrating particle in 
space instead of in time and to consider a quantity uv connected with the 
motion as a function “ = u(x) of x rather than of ¢. The infinitesimal proba- 
bility when « is the variable is Pda and 
Faz = ie P= : 


] 
, an “i , 
T Va? — 4? 7 Va? — x? 
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Under this assumption the probability function is not constant but is inversely 
proportional to the velocity. From the work done above in determining 
average values it is clear that the time-averages were in reality obtained with 
as the variable although ¢ is the variable with respect to which distribution is 
uniform. The average of ~ is not the average ordinate in the curve 1 = (7) 
between the limits « = —« and » = +4 when the ordinates are considered as 
distributed uniformly in «; the density of distribution must be inversely pro- 
portional to the velocity which is a function of zx. 

Thus far it has been a single vibrating particle which has been under 
consideration and has been followed whether in time or in space over a com- 
plete swing or part of a swing for the purpose of obtaining the averages of 
quantities connected with its motion. For purposes of generalization it is 
convenient somewhat to change the point of view. Instead of following the 
particle in space (and thus following it implicitly in time) let us suppose that 
at any instant there are an infinite number of particles distributed throughout 
the interval from 7 = —a tox = +4, each one moving exactly as the single 
particle heretofore considered would move if at the instant considered it 
happened to be passing through the position of that particle. This amounts to 
changing from a single particle with a definite phase-angle 7 to a set or ensem- 
dle of particles with different phase-angles but otherwise entirely alike. Further- 
more let the density J) of the distribution of particles through the interval be 
proportional to the probability function so that 


TYVa— 


Now it appears at once that for the average of any quantity uv taken for all the 
particles of the ensemble the weights will be Jdxt and hence that 


* Of course it would, strictly speaking, be impossible to distribute the particles contin- 
nously. This, however, is a difliculty which arises in any treatment of a body as a continuous 
distribution of matter or mass. The density here is a linear density and the number of par- 
ticles or the amount of matter in the interval dz is Ddv as in the ordinary treatment of a chain 
or rod. 

+ For Ddr represents the amount of matter or number of particles in the interval dr. 
The conception is entirely similar to that previousiy mentioned in connection with the fact that 
Pde was the number of ordinates in the interval dy when P was the frequency or density of 
the ordinates, 
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“4a "Loa 
/ uDdr | uPdx he 

w= Ss = = uPde (since D x I) 

| Dax | Pde J-* 


a 





and that in like manner the average taken over a part of the ensemble will be 


- - 
| u dade | u Pdr 


“= = + —-— 


oT ert ‘B 
/ uDder [ Pax 


From the above formulas it ix seen that the average values of u taken 
over the ensemble or any part of it are identical with the corresponding aver- 
aves found by following a single particle. This identity is not fortuitous but 
arises necessarily from the law of density assumed for the distribution of the 
particles in the ensemble. The advantages of passing from time-averages to 
ensemble-averages will appear more clearly in the next section. One point 
should be noted here. The density of the distribution of the particles has 
been assigned at a specified instant. From that instant on, each particle will 
execute a simple harmonic motion and therefore at any subsequent instant the 
density of the distribution will be determined by the original distribution and 
by the motion of the individual particles. It is extremely important, in fact 
it is essential, that the density of distribution shall not change with the time ; 
otherwise if the ensemble-averages were taken at a later time, the averages 
would not in general be identical with the time-averages. 

Suppose that J is the density of distribution of particles in the en- 
semble. Consider any interval /¢ between * = —a and « = +a. The number 


‘ae t+de 
of particles in this interval is | Désx,* and it is desired to express the fact or 


the condition that there is no change in this number owing to the motion of 
the particles, that is, that as many particles are brought into the interval dr 
as are removed from it during each interval of time. For convenience let the 
interval of time 6¢ be chosen as small, small even relatively to the time that 
it takes a particle to move over the interval dr, and let the velocity of the 





* The symbol 4+ is used in place of d+ in the integrand to avoid confusion with the total 
interval dz. 
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particles be positive. In the time é¢ the particles which are in the neighbor- 
hood of the point « have the velocity + and will move through the 
distance 6c = v6. The number of particles in the interval dc = vét just to 
the left of a is bec = PMrét and it is this number which will advance into 
the interval dx in the time 6¢. The velocity of particles in the neighborhood 
of x + de is v + dv and the distance they will move is 8x = (v + dv)é. 
The number of particles in the interval 6” = (7 4+ dvr)ét just to the left of 
e+dxis (D+dD) 8e =(D+ dD) (v + dv)ét and it is this number which 
will advance out of the interval (x in the time 6. Hence if the number in the 
interval is to remain constant, it is necessary and sufficient that 


(D+ dD)(v + dvyét — Drét = 0. 


The condition thus obtained may be simplified by discarding &, by 
expanding and canceling Jr, and by omitting the infinitesimal 7Ddy of the 
second order. The result is the differential equation 


rd D) + Ddv = 0, of which Jr = constant 


is the integral. It therefore appears that the necessary and sufficient condi- 
tion that the distribution of the particles remain unchanged owing to their mo- 
tion is that the density be inversely proportional to the velocity. As this was 
the law assumed above, it is clear that that distribution was permanent in time. 
Such a distribution is said to be in statistical equilibrium. The result Dy = 
const. just obtained depends only on the facts that the particle has one degree 
of freedom and that the velocity is a function of the coordinate specifying the 
position of the particle.* | In such mechanical systems the only possible distri- 
bution in statistical equilibrium is that in which the density is inversely 
proportional to the velocity, and averages taken over the ensemble thus distri- 
buted are identical with the time-averages found by following a given particle 
throughout its motion. 


2. Ensemble-averages in a dynamical system with one degree 
of freedom. In the foregoing section it has been assumed that a definite 
particle was under consideration and that consequently the amplitude @ of the 
vibration was known. To know the phase-angle was unimportant because the 
particle was followed throughout a vibration. The ensemble which was intro- 
duced was obtained by disregarding the phase-angle and assuming an infinity of 


* It should be noted that the conception of statistical equilibrium as set forth in these 
paragraphs requires the motion to be conservative. 
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particles all with the same amplitude (and consequently with the same 
energy) but distributed over all possible phase-angles. This means that one 
constant of integration, the constant 7, has been regarded as unknown and that 
a distribution relative to it has been introduced to make up for the lack of 
knowledge. Suppose next that all that is known about the particle is that. it 
satisfies the ditferential equation of simple harmonic motion witha given value 
for n. The initial conditions which determine the constants of integration « 
and y are unknown: no particular particle can be followed in its motion. It 
is necessary to replace the particle by an ensemble in which distribution with 
regard to both constants of integration is allowed. This ensemble will be a 
two-dimensional ensemble, not a one-dimensional ensemble as before. 

The initial conditions which determine the motion of any particular 
particle are the position and the velocity at any instant: these may be inter- 
preted as the coordinates of a point in an auxiliary «-r-plane. Thus at the 
given initial instant there will correspond to each possible motion regulated 
by the given differential equation a detinite point of the x-v-plane, and con- 
versely to each point of the plane (possibly only within certain restricted 
regions of the plane) there will correspond some particular motion with speci- 
fic amplitude and phase-angle. The point (.7, ”) will be called the represen- 
tatine point of the motion. As time goes on, the representative point for 
any particular motion will change from its initial position and will describe 
a curve in the z-7-plane: for both the position and the velocity of a given 
particle are functions of the time. In the case of simple harmonic motion 
these curves will be the ellipses °z? + 1? = v*a*. Nothing as yet has been 
said as to the density assumed for the distribution of the representative points 
in the z-v-plane at the initial instant.* That requires careful consideration. 

In the first place let it be noted that what the density will ultimately be 
used for is to determine average values over the ensemble according to the 


rule 
/ / ” Ddrdv 
/ | Ddxdr 


* It should be remarked that here we are speaking of the density of representative points 
and are not thinking of the density of the particles as in the previous section. In fact it might 
have been better though less concrete to have spoken in the first instance of representative 
points and of their density of distribution rather than of the particles themselves. 
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where u is the quantity to be averaged, where Ddzdv is the product of the 
density by the element of area and consequently represents the number of 
motions whose representative points lie in that element, and where the double 
integrals may be extended over any region of the z-v-plane which contains the 
points representative of those motions for which the average is desired. I¢ 
the density J of the distribution at the initial instant were assumed at random, 
the motion of the representative points would in the course of time bring 
about a new distribution which in all probability would differ from the origi- 
nal distribution ; the averages at the later instant would then probably differ 
from the original averages. To avoid these inconveniences it is desirable to 
impose upon the density D the restriction that it shall represent a permanent 
state, that is, that the ensemble of representative points shall be so distributed 
as to be in statistical equilibrium.* 

The conditions for statistical equilibrium are not hard to find. The most 
general assumption for the density J) would be that /) is a function D)(2, v, ¢) 
of the position, velocity and time. Now if the distribution is permanent, the 
density cannot depend explicitly on the time, and ¢D/o¢=0. At a given 
time the number of representative points within a given rectangle J? of the 
x-v-plane between x and « + dx and v and v + dev is 


“r£+dr w+dr 
| | D(x, v)dxde. 


The number of points can change only by some of the points moving across 
one of the four sides of the rectangle. As 6 = vét, the points which will 
move into J? across the left hand side of length dv are those situated in the 
small rectanglet of area dedv just to the left of 72; their number is Dv dtdv. 
The points which will move out across the right hand side of length dv at «+ dx 
must be those in the small rectangle of area &'xdv just to the left of that side 
and, as v is here one of the independent variables and is constant along li.es 








* Attention may again be called to the fact that the conception of statistical equilibrium 
requires the motion to be conservative; for if the motion were dying out, it would not have the 
necessary characteristic of permanency in time. 

t The same assumption as before relative to the magnitude of 5¢ will be made, namely 
that 5¢ is small compared with the time it would take one of the points to move across the rec- 
tangle 2. Moreover, owing to the smallness of 5z as compared with dv or dr, the number of 
points which might move out or in across the short sides 5c of the small rectangle may be 
neglected. 
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parallel to the x-axis, 8’x = dx = vit; the number of the points is therefore 

(D + = dz) vétdv. As dv = f8t, where f denotes the acceleration which 
Cx 


is given by the differential equation as a function of 2, the points which will 
move into #2 across the lower side of length dx at v are those in the small rec- 
tangle of area dvdx just below #; their number is D/fétdx. The points 
which will move out across the upper side of length dx at v + dv are those in 
the small rectangle of area d’vdc just below that side and, as f is a function 
of x alone and is constant along lines parallel to the v-axis, &'v = dv = fot; 


. a ; D ; 
the number of the points is therefore (v + “= dw) fetde. Hence, if the 
ae 
total number of points is to remain unchanged, 
cD . - op ; 
Dvitdv — (p+ - dic vétde + Dfédtde —{( D+ = dv) Sdtdx = 0 
Cx ov 
cb oD , 
or ~ “e+ J = . 
CL cv 


This is the condition for statistical equilibrium which was desired. It is there- 
fore seen that // must satisfy a detinite partial differential equation in «2 and v. 
The solution of the partial ditferential equation just found depends upon 
the svlution of the simultaneous set of equations* 
dw di dp 


, Ff 6=(Ci‘k 
The solution of the first two equations may be obtained at once : 


Side = vdv or wifilar =~ mrdr or Fir = medv 





if the mass of the moving particles is # and /’ is the force acting on it. The 
equation may be simplitied by writing /’= — dV/dc, where V is the poten- 
tial energy (the motion is assumed to be conservative). Then 


—-dV=d(3 m2), and finally 4 mv? + VW = const. 





* It may be noted that if these equations be written in parametric form as 
dr dv dt 


v Fd | 


they define the path of the representative point in the s-¢-plane, 
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Hence the value of D is D = Q (const.) = 2 (#), where £ is the total energy 
and (2 is any arbitrary function. It therefore is seen that the condition for 
statistical equilibrium in the distribution of representative points for possible 
motions of a conservative dynamical system of one degree of freedom is that 
the assumed density should be a function of the total energy. 

The ensemble-averages will naturally depend upon the choice of the ar- 
bitrary function 2. Suppose that (in the special example under considera- 
tion where f= — 127 and m = 1) this is taken as 


p= ()( IL) = he- CF = kre 2 het + bntct : 


where / and ¢ are constants; and let it be assumed that any values are per- 
missible for the velocity and position of the particle so that the representative 
points are spread all over the -v-plane. The mean square of the displacement 


gy? “ x v= 


x 
‘$e (+e 
| / e- < het + ntact dad 


x will then be 


The form of the integrals is such that the integrations with respect to 2 and v 
are quite independent. Hence, discarding the integration with respect to v 
which introduces merely a factor in numerator and denominator, the result 
may be written as 


+ -£ e ] 
207 2 Pd yp oe 27 
/ xe da 3,3 \=7 1* 

2 es es = - = ” = _ = J » . 
= “+ x Pe i cn” 
— nie ! 

es dx \ 27 
- cn 


The average value of the potential energy would be 1/2c?. It may readily be 
shown that the average of the kinetic energy would also be 1 2c?. The average 
displacement would of course be zero. To tind the average of the magnitude 
'2| of the displacement it would merely be necessary to evaluate 


*By the ordinary methods of evaluating these integrals or by B. O. Pierce's Short Table of 
Integrals, formulas 492 and 494 of the revised edition. 
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Other averages with this assumption for the density of the distribution and for 
the extent of the distribution of the representative points in the plane may be 
obtained if desired. 

To connect the present discussion where the energy is allowed to have 
different values with the preceding case in which the energy was fixed, it is 
merely necessary to consider those motions whose representative points lie be- 
tween the curves / = Cand FE = (' + d¢ of equal energy. In the above ex- 
ample these are similar and similarly situated ellipses. If zand 4’ instead of 
xand v are taken as the independent variables, the density of distribution 
takes the form 
da 


, 
. 


Dded b= Q(E) tal dalE = 2(4) dE 

ck m ’ 

where m is the mass of the particle; and hence it appears that if the energy 

be held constant the density regarded as a function of « is inversely propor- 

tional to the velocity. This is in accord with the results of the last section, 

as was to have been expected. For different energies the density is no 

longer proportional to the velocity because the factor Q (£#)) is ditferent for 
ditferent values of F. 

Once the density is fixed as 2 (£), the probability function which is to 
be taken proportional to the density becomes /?=/ 0( EF). If the limits of 
the region throughout which the representative points are assumed to be spread 
are also assigned, the value of & may be determined by the relation 


| [42 Eydete = or f [ECO eae =, 


mn t 


where the double integral is extended over the entire region occupied by the 
representative points: for the total probability is 1. It is thus seen that where- 
as in the simplest case where there is a constant energy it is most natural to 
start with the conception of the probability and the time-average and then to 
proceed to the ensemble and to its density of distribution, in the case of vari- 
able energy it is simpler and almost necessary to start with the ensemble in 
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statistical equilibrium and proceed to the probability. And whereas the time- 
averages and ensemble-averages were identical in the first case, they are dis- 
tinct in the second—or better, one might say that the conception of a time- 
average implied the following of a definite motion or at most a group of motions 
which differed only by an additive constant for the time and that therefore the 
conception became illusory in the second case and we were forced to use en- 
semble-averages. 


3. Systems with two degrees of freedom. Further to investigate 
the advantage or necessity of having recourse to the ensemble in statistical 
equilibrium consider what happens when averages are desired for a particle or 
system with two coordinates. To exhibit some of the points at issue it will be 
sufficient to consider the simple case of harmonic motion relative to two per- 
pendicular axes and defined by the equations 


Pr 


My i dr d y 
de 


2 yom —— 
geen Fees Tt Vey, 


— 2, 
r=acos(nt+y), y=bsin(mt+ 8), T= $(0?a? + wil? — n2x? — my’). 


As the mass of the particle has been taken as unity, we have 7 = \2 7' and the 
time required for the description of the are ds is 
ds = y dx? + dip? 


dt pe a nm SS ~ 
Vnta? + nl? — n2x? — m2? 








If as in the tirst section the probability be considered as proportional to this 


time, there results 
Ads ; keds 


Pat = — : — _ em ry so —= ee = = => 1 . 
a 4 * 4 4] 4 ” ”» s) . ” 4 a 
Vn2a? + mb? — n2? — ni? J ¥nta? + nh? — nPx? — Py? 





where the integral extends over the entire length of the curve described 
by the particles. 

Now if the constants n and m are commensurable, the particle will 
describe a closed path, the integral which must be evaluated to determine * 
will converge, and the value of / may be found. Then the time-averages of 
any quantities connected with the motion could be found as in the first 
section. But if and n are incommensurable, the particle will describe an 
endless Lissajous’ curve, the integral will diverge, the determination of & will 
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be impossible, and the whole problem will become illusory. It appears, 
however, that the particle after a sutlicient lapse of time may be found as 
near as desired to its original position and with its velocities 7, and 7, as near 
as desired to their original values. By disregarding these small differences the 
path may artificially be considered as closed, a value of / may be computed 
on this assumption, and the averages may be found. This method of dealing 
with the problem is possible and might readily be adopted from a mathemat- 
ical point of view and would undoubtedly give satisfactory physical results. 
But the method which actually is adopted is to pass over to the consideration 
of ensembles distributed in- statistical equilibrium. This will) now be 
explained. 

Four initial conditions are required to specify the motion of the particle, 
namely, the initial values of «, y. r,, ",. Let these four quantities be inter- 
preted as rectangular coordinates in a space of four dimensions, thus obtaining 
in that space a representative point for the motion. Each point of the space 
will correspond to some motion* and every motion will have a definite repre- 
sentative point at the initial instant. As time passes, the representative points 
will describe paths of one dimension, that is, curves in the representative space 
of four dimensions. Everything is quite similar to the simpler case where 
there was only one coordinate and the representative space had two dimensions. 
ITere the most general assumption for the density would be that it was a 
function D (x, 7, v,, ",, ¢) of the tive variables which enter into the desecrip- 
tion of the motion. If there is to be statistical equilibrium, that is, if the 
density is not to depend on the time, considerations like those previously 
advanced show that J cannot depend explicitly upon ¢ and must satisfy the 
partial differential equation 


eD cD chp). cD 
s VU; 7 eo vy + a ,. = a — 0), 
Cr. cy ‘ ov,° cr ¢ . 


The details of the deduction may be omitted. 
The solution of this partial differential equation depends on the solution 
of the system of ordinary equations 
dy dy dv, dr, dap 
Te Y, vie , i ae a 








* There may be need of restricting the range of representative points to some region of 
the four-dimensional space; for the energy must be real and moreover it may be desirable to 
restrict our discussion to only some of all the possible motions. 
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There will be three independent integrals of these equations and the density 
D will be a function, an arbitrary function, of the three. In the particular 


case considered above the equations become ? 
dr dy adr, - dv, _ dD 
vy vy wer miry Q’ 


and the integrals are readily seen to be 


n?x? + ¢2 = 2c,, nery? + vs = 2¢,, 
1 ux 1 my 
; cos~ ! SoST - = cos~! —.. An = C23 
ue \ nx? + ve dit \ my? + Uy 


and hence if the distribution is to be in statistical equilibrium it is necessary 
and sufficient that the density be 


D = 2(e), Cg, ¢3)- 


It will be noticed that the sum of ¢, and ¢, is the total energy Z and the 
density may therefore also be taken as 


D= Q(B, ey — ex, ¢5)- 


Averages over the ensemble are obtained by quadruple integration 


/ | / / uDdedydv ar, 


oe 


If] / Didxudydv,dr, 


extended over such four-dimensional regions as contain the representative 
points of the motions for which the average is desired. 

It should be noted that the three integrals ¢,, c,, ¢;, are invariants for 
the motion of any given particle, that is, are constant in time when any partic- 
ular motion is followed. For they satisty the above set of simultaneous 
ordinary equations and hen:e also the partial differential equation which D 


satisfies. That is, 


ce ery OC, » , OY . 
a ?. T v, + | ay Je 7. = _ Fe = 0, 
Ox Cy" Ov, Ov» 


and so for the other two. But as ¢,, and also the others, do not contain the 
time explicitly, it follows that dc,/¢¢ = 0 and hence 
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de, Cry Ce, Ce) Cry ‘ Cry . P 
- mae + aU, + x vy + — Je + -— Jy = 0, 
dt ot Cx cy cv, ov, 


which shows that the total derivative of c, with respect to the time is 0. 
It is likewise evident that any function of these three invariants is an inva- 
riant of the motion. Moreover there could not be a fourth invariant which 
was functionally independent of these three. For if there were four invariants 
Cys Cay Czy Cy Which were functionally independent, they could be solved for 
the four arguments x, y, ",, v,; they would therefore entirely fix the repre- 
sentative point, and there would be no possibility of passing from one such 
point to another in following the motion. It is clear that apart from the 
particular case of harmonic motion there will be for the motion of any system 
with two degrees of freedom three and only three functionally independent 
invariants. If there is statistical equilibrium, the density must be a func- 
tion of these three quantities and as the energy is surely an invariant of the 
motion, it may be taken as one of the three. 

An interesting connection with the theory of time-averages may now be 
made. For, suppose that the density is chosen as a function of the energy 
alone, as )) = 2(£), and let the energy be taken as one of the independent 
variables replacing v,. Then the density may be written in the form (if it 
be assumed that the mass is unity) 


ly I; 
Dixdydy dE =O(L) “s dydv,d B= Q(E)dydrd E wot id 
. - . . . hi 


Now consider all the motions whose representative points lie between the two 
surfaces (of three dimensions) / = Cand F' = ('+ dC’ in the representative 
space of four dimensions. For these motions 2 (#) has the sensibly con- 
stant value 9 (C’) and hence it appears that the density is inversely propor- 
tional to the velocity with a constant factor of proportionality 2 (C’). As 
the probability is taken as directly proportional to the density, it is seen that 
the probability is proportional to the time of describing the are ds. Hence if 
averages be obtained by integration over the region between the surfaces E = 
Cand FE = (+d, those averages may be considered as time-averages, 
and are generally so considered although it is clear that motions whose 
representative points lie between these surfaces are by no means so restric- 


Cv, ] . " - 
*F ie . 2 es 2 2 
For am “= and dr: yds? +dy/ =v: 4272 40,7. 
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ted as to differ merely by an additive constant for the time. Thus it appears 
that the introduction of statistical equilibrium and ensemble-averages has led 
naturally to a very considerable entension in the idea of a time-average, an 
extension which at first sight might have seemed to be contradictory to any 
notion of time-average. 


Conclusion. The foregoing discussion has been presented as an intro- 
duction to the general theory of statistical mechanics. It is not intended to 
present that theory here. Readers may refer to such portions of Gibbs's trea- 
tise on the subject* as may interest them. It may, however, be stated that 
the subject is concerned not so much with the physical system which is exe- 
cuting the motion as with the differential equations of the motion. The word 
particle has been employed so largely merely for definiteness. The work of 
the text which was connected with the equation d?x d@ = — n’x would apply 
equally well to the vibrations of a torsion pendulum where x represented angu- 
lar rotation and + represented angular velocity. The problem of article 3 
could be interpreted as being that of the vibrations of any system depending 
on two generalised normal coordinates. Under this class of problems would 
come the case of a stretched massless string carrying two particles constrained 
to move in a plane and in lines perpendicular to the position of equilibrium of 
the string. t 

In considering the motion of any conservative system of which the position 
is specified by nx generalised coordinates, it is customary to introduce the gen- 
eralised momenta in place of the generalised velocities. If, 92, - + +, 7, are 
the generalised coordinates, 7’ the kinetic energy, and V' the potential energy, 
the generalised momenta are 


; ™ oT ' ” oT _ oT 
= on,’ P2= 3," » Pa = 55. 
by definition. The equations of motion are then 
_¢e(T-V) . o(7T-V) 
— —_ . — Fn 


* Elementary Principles in Statistical Mechanics by J. Willard Gibbs, 1901. 

t This problem is frequently treated as a preparation for considering the string as loaded 
with n particles where n is allowed to become infinite and thus lead to the vibrations of a violin 
string according to the method used by Lagrange. 











148 WILSON 


in the Lagrangian form, or in the Hamiltonian form they are 

e(T+V)  . &(T4+V) 

= - q,= a 

] t ey; ’ 7i Cp, ’ 
There are 2 initial conditions corresponding to the initial values of the 2n 
coordinates 4 and momenta p. These 2n quantities may be regarded as the 
coordinates of a representative point in 2” dimensions. The condition for 
statistical equilibrium is then that the density D of the distribution of these 
representative points over the auxiliary space of 2x dimensions shall satisfy 
the partial ditferential equation 


=1,2,---., vw, 


cD . ob OD. ‘ oD. . 
— ¢ ——Tipt c+ * = Vn = a= ° 
on " ép,! ’ Fn , ép,? 


The solution of this equation for J) depends on the solution of the set of 
simultaneous equations 
dq dp, dq, dp, dD 
N "} Pi 7 nm dn Pn ° 
The total energy = 7+ J is one integral of these equations and there are 
2n — 2 other integrals (C,, C,,- +--+, Cy, 2... The density D is therefore any 
arbitrary function 2 of these quantities, 


D= NE, Ci Ces 12 Cen aes 


It is generally customary to consider the density as a function of the energy 
alone. The particular function which Gibbs uses and which, under the as- 
sumptions which he makes, establishes the principle of the equipartition of 
kinetic energy, is 
F-—b 
DxzP=+ ° , where @ and ¥ are constants, 


and the distribution is then called canonical. In the particular case where 
the attention is concentrated on the motions whose representative points lie 
between two adjacent surfaces = Cand EF = C + dt’ of constant energy, 
the distribution is called microcanonical. The advantage of discussing the gen- 
eral canonical case first and the microcanonical case second has been sutlicient- 
ly illustrated by the treatment of the problems in the text. 


MASSACHUSETTS INSTITUTE OF TECHNOLOGY, 
JANUARY, 1909. 











THERMODYNAMIC ANALOGIES FOR A SIMPLE 
DYNAMICAL SYSTEM 


By Epwin BipwELL WILson 


Introduction. Iw a recent article * I have set forth some of the most 
elementary notions in regard to the statistical mechanics of dynamical systems. 
The object of the present communication, which may be regarded as a sequel 
to the other, is to investigate a simple dynamical system for the purpose of 
exhibiting certain analogies between its statistical properties and the elemen- 
tary thermodynamics of gases. The dynamical system which I shall choose is 
the somewhat familiar household article known as the Japanese or Chinese 
dinner-gong and consisting of a series of gongs of varying sizes suspended 
one above another upon a vertical string. The number of degrees of freedom 
of a set of several gongs is considerable ; and to simplify the work it will be 
assumed that the only motion which the gongs may have is oneof rotation 
about the vertical string as axis. Under this hypothesis the number of degrees 
of freedom will be equal to the number of gongs in the set. 

To find the equations of motion of the system in the Lagrangian or 
I[amiltonian form, it is first necessary to write down the kinetic and potential 
energies of the system. If 4, J, Z;,--- are the moments of inertia of the 
successive gongs beginning with the top one and if 4, G2, 93, --- are the 
angles through which the respective gongs have turned from the position of 
equilibrium in which there is no twist in any of the strings, the angular veloc- 
ities of the gongs will be respectively 7, g2, Y3, +++ and the kinetic energy 
of the system will be 

T=3(hGit hii+ hit ---)- 
The momenta corresponding to the different coordinates are 


oT ' . ipi 
= iq, = 1q; or q;= 1 : 





* Applications of Probability to Mechanics, ANNALS OF MATHEMATICS, volume 10, pp. 
129-148 (1909). References to this article will be given simply by the page numbers in pa- 


rentheses without the quotation of the title. 
(149) 
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and hence as a function of the momenta the kinetic energy is 


i % Ps 
T=q=3 (P+ + = te. -)." 
. LK & &s 
For the purpose of getting rid of the constant 7 in some of the subsequent 
averages, the assumption will be made that the product 
1 


~ Qa’ 


5 
and the kinetic energy will be taken in the form 
T= 6, = 7(Q Pi t+ 2pPpt+ Upyte++), Uads---=1. (1) 


The potential energy of the system will consist of the energies stored in 
the twisted strings. The twists in the successive strings beginning with the 
top one and running down are 

i» Is — Tis Ys — Yn °° * 

The potential energy will be proportional to the square of the twist, where 
the factor of proportionality may depend on a variety of things such as the 
kind of string, the diameter of the string, the length of the string, and the 
tension in the string due to the weight of the gongs. In order that there may 
be some possible variation of the system as a whole it will be assumed that the 
length of the uppermost string is adjustable ; this could easily be arranged by 
running the string between two closely pressed wheels, which could be moved 
up and down along the string, instead of merely pinching it in a vice. It is 
then important to know at least how the factor of proportionality varies with 
the length of a uniform string when nothing but the length changes. Now 
from the symmetry of the figure it appears that to twist a string through an 
angle q is equivalent to twisting each half of it through the angle 4 ¢ in oppo- 
site directions. Hence if the factor of proportionality is supposed to vary 
with the nth power of the length and is written as //", the equation of the 
whole potential energy to the energy of the two halves gives 


]\n q 2 1\” 
j:/n a. ei — i = 
my (3) (3) i 


* The use of ¢, for kinetic energy, e? for potential energy, ande = T + V =e + «¢, for total 
energy is adopted to conform to the usages of Gibbs's Elementary I’rinciples in Statistical 








Mechanics. 
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and it is seen thatn =—1. The potential energy of the system will there- 
fore be written in the form 
Vis eg = Aly'Gi + ele'(2 — N)? + Als (Ge — GM)? +--+ 


For the sake of simplification let it be assumed that an adjustment between 
the sort of string and the diameter of the strings and the units of measure- 
ment has been made so that 


k,ly' =T, k,l * =m,---; and let k,l; = wu, 


so that the parameter v is proportional to the fourth root of the length of the 
uppermost string of variable length. Then 


Vse=a[v'git (92-1)? + (3-2)? +--°] 


is the expression for the potential energy of the system. 
The differential equations for the motion would be 


At+e7*4+)a—@=9, prt 2e—N—-G=%--> 
or 


Pit(e~£+1)a—-—2=9, W=UPy Pat 242-N—-G=9% G=%Pn--- 


by direct substitution in the formulas of the Lagrangian or Hamiltonian equa- 
tions (see page 147), where 


T— V=e,—¢€, = 7[a pi + 42 ps + Gsp3 +--+ — vt gi — (92 —- )* 
hey ages) 
or 


T+ Vsze4+¢e,=7la pit appt apst---+ter'at (a- MN)’: 
+(q9; — 92)? + + >>]. 


If D denotes differentiation with respect to the time, the Hamiltonian equa- 
tions after the elimination of the momenta are 


1 
E DP + (u-*# + 1) a — 0% = 0, 
1 


. 
—% +[-. LP + 21 — 4s = 9%, 
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Hence, if for example the number of gongs is three, the solution of the dif- 
ferential equations* depends on the solution of 


1 44 —1 0 
a, 
1 
wi —~P+2 1} =0, 
(ly 
© ie 
0 — 1 —PFPr+1 
a3 








which is an algebraic equation of the third degree in J*. If there had been 
five gongs the solution would have depended on the solution of an algebraic 
equation of the fifth degree in )*. As these equations are necessarily literal 
when the length of the uppermost string is adjustable and v cannot be assigned 
a numerical value, the solution of the algebraic equations is impossible in 
practice. This will make no difference to us inasmuch as the discussion of 
statistical mechanics is based on the differential equations and not upon their 
integrals. 


1. The canonical distribution. Suppose for the moment that the 
system consists of three gongs. The state of the system is at any instant 
completely determined by the values of the six quantities 9), G2) 93, Pis Pa» Ps- 
The representative point of the motion would therefore be located in a space 
of six dimensions. Consider an ensemble of these dinner-gong systems dis- 
tributed with the density or better with the probability (see page 148) 


u-e 


P=e"*, e=T+V=6,+€,, wand © constant, 


where 
e=m [pi + 2p; + Gp; + “Gi + (42 — N)? + (4s — 92)")- 


The distribution will be in statistical equilibrium because the probability, or 
density which is proportional to it, is a function of the energy alone and the 
energy is an invariant of the motion. Let it further be assumed that the dis~ 








* See Forsyth’s Differential Equations, 3rd edition, p. 305. Or Cohen's Differential 
Equations, p. 150. Or any treatment of simultaneous linear differential equations with constant 
coefficients. 
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tribution covers the whole of the six-dimensional representative space so that 


the relation 
: wre 
5) 
SLL LL © dp, dp,dp;dq,dq,dq3 = 1 (2) 


must be satisfied (see pages 132, 142). This distribution will be considered 

as the canonical distribution and averages over it will be investigated.* 
The equation (2) may be used to determine the constant y in terms of 

the other constants of the problem. For as W and © are constants, the 


uv 


factor e* may be passed out from under the sign of integration and may be 
transferred to the other side of the equation. Then 


+ @ 
e? -| LLL [ [ee drdpetpran ann = 
to _ =p, +2 _ =p; ee owe 
[ e “9 ay [ e “dp, | e “9 dp, x 


+x = =q,* += _ (9 = q,)? be x AF q:)* 
e “diy e - a | e eo dq. 
J=— & ee “J-« 


Ilere the six successive integrations are to be carried out by beginning with 
the one for 7; and working backward through the list. As g, is regarded as 


* As ageneral comment on the assumptions underlying the foregoing use of a canonical 
distribution, one point in particular should be mentioned. It has been assumed that the repre- 
sentative points for the motion of the systems in the ensemble are spread over the whole of the 
representative space, that is, that the coordinates ¢,,¢7,, 7,, ..- and momenta p,, p,, P,,- ..may 
have any value. This necessitates the admission that the angular velocities of thediscs and the 
twists in the strings may be indefinitely great. It goes without saying that this is physically 
impossible: for if the strings were allowed to twist very much, the law of torsion would de- 
part from the simple linear (Hooke’s) law and the expression of the potential energy would 
change; furthermore the twisting would perhaps materially shorten the strings and introduce 
complications on that score. It should be noted, however, that as the coordinates and momenta 
increase, the energy increases with their square, and the probability assumed for the distribu- 
tion decreases exponentially with the energy. Hence the probabilty of systems with large ve- 
locities and twists is almost negligibly smallcompared with that of systems with small velocities 
and twists. The errors made by entirely neglecting the systems in which the velocities or twists 
were so large as to introduce physical difficulties would therefore probably be small and could 
certainly be made small by restricting @ to sufficiently small values. lence there is no need 
of worrying very much over such difficulties. 
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constant for the integration with respect to g;, we may write 


ee ae ‘+2 _ x(q;— 9)" V8 -~ve 
| ¢ s a= | e - htt Vr = yo. 
—_—s T 


—@® 


The result is independent of g,; in fact the integrals may be regarded as 
multiplied instead of iterated. The result is 


e 


te¢ 


= Vaart, v°(VB)* = Or, 
the last reduction being due to the assumption (1). Hence 

y=— 20 logrv —30 logo. (3) 
It is readily seen that if there had been n gongs, the expression for y would 


have been 
vy=— 20 logv—n@ log ©. (3') 
Another important quantity is the index » of probability which is detined 


as the logarithm of the probability. Thus 


y—e 
n= log P= On P= e. 


The average value of the index of probability » taken over the ensemble is 
found by evaluating the integral 


n =| ° , ° | nerdpy cee dy. 


This may be evaluated directly ; but the work may be very much shortened 
by adopting the device of differentiating (2) with respect to @ regarded as 
a parameter. Upon the left-hand side, the differentiation may by Leibniz’s 
rule be carried under the sign of integration. The result is 


+2 £9 Y= +? (fF ¢—<« 13895 t= 
8 ” 
[ [i dp, es dq. =|: | |- 7 6 5 | e dp, i di; =. 


Now © may be canceled out and the result transposed so that 


oy pte pecs +e yo, oo! 
56. i fe 6 dp, cee dq =| . IS "> dp, c 8 dqs 


x 


ey +e 
id so ' | + | nen -+ + dys = 9. 
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That the factor 0y/d @ may be transferred to outside the integral sign 
depends on the fact that y in (3) does not contain the variables with regard to 
which the integration is performed. 
By differentiating y with respect to © the value of 7 is 
7 = — 2 logv — 3 log 8 — 3. (4) 


And now the average value ¢€ of the energy e may be found very simply. For 
as y and @ are constants, it follows from 


= © : 
amen that = 
Hence e=Y¥—697n = 30. (5) 
Moreover if there were n gongs, the results would be 


)=—2logv—n log O—n, (4') 
YY —-97=790. (5') 


€ 


The parameter v which enters into the expression for the potential energy 
and which determines the length of the uppermost string may be regarded as 
an external coordinate of the system. The difference between an external 
and an internal coordinate is simply that the internal coordinates are those 
which take an active part in the motion, and in the solution are to be ex- 
pressed in terms of the time ; whereas the external coordinates fix the system 
as a whole and take no active part in the motion, although different motions 
will arise from varying such coordinates. Now the forces are derivable from 
the potential energy and they will be of two sorts, internal and external. If 
Vi, Gar Ys, - ++ are the internal forces and A is the external force,* then 


Ce, Ce, Ce, 
V; —_—_-- qu 9 (Vs —_—_—_ = U ’ 9 A — = at 
on : TE: ov 





* The external force A should not be confused with the force which is required to support 
the system of gongs from falling under the influence of gravity. The force of gravity does 
not enter essentially into our problem. In fact it might be convenient to imagine that the force 
of gravity were continuously diminished toward zero as a limit as if the system were removed 
to a point indefinitely near to center of the earth. It may be noted, however, that the force of 
gravity does no work on the system even when the length of the uppermost string is changed : 
for the assumption has been made that the wheels which pinch the string are moved up and 
down to regulate the length rather than that the string should be paid out between them 
in such a way as to raise or lower the system. 
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It is easy to see without recourse to formulas that if the uppermost string is 
lengthened, the system loses energy: for a given angular twist upon a short 
string represents a greater amount of energy stored in the string than does the 
same twist in a longer string. 

The average value of the external force «1 over the ensemble may readily 
be computed. It is merely 


J += Ce » het 
\Asf..-f — %e © dp,--- das. 
f / cv ” . 


To evaluate this the device of differentiating (2) will again be used, but in 
this case the differentiation will be with respect to v. Note that y contains v but 
© does not, and that as ¢, does not contain v, the derivatives of « and ¢, with 
respect to v are equal. Then 


$e did 1/cy hs T2 
[-- TE ov!” ae “ dp, ces diy; = 0, 


A= ae a ee +s fend acu aes, 
[- / = 7/3 lig CPi 43 


Hence , a: haa. (#) 
cv v 


The result is independent of the number of gongs. 


2. A thermodynamic analogy. Let p, V, 7, U’, S be respectively 
the pressure, volume, temperature, energy, and entropy of a gas. Let the gas 
be assumed to be in so far perfect or ideal as to satisfy Boyle’s characteristic 
equation p= /?7 without sensible error. Further let the units of measure- 
ment be those ordinarily adopted in theoretical and practical physical chemis- 
try.* Then pV = 2T is the form of the characteristic equation. Moreover the 
energy of a perfect gas varies directly with the absolute temperature and is 
(’=c,T where c, denotes the specific heat at constant volume. In_ the 


* At the absolute temperature 7 = 273° centigrade of melting ice and under the normal 
atmospheric pressure of 76 cm. of mercury or 1033 gr. per sq. cm., the gram-molecular volume 
is 22,380 cu.cm. Hence 

1033 X 223%0 


(= =— 9 ‘ ; - 
I 273 X 42350 2 (or just a trifle less). 


See Walker's Introduction to Physical Chemistry, 2nd edition, p. 30, or any similar work. 
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physico-chemical system of units ¢, = 3 for a monatomic gas ; for the ordinary 
diatomic gas c, = 5; and for more complicated gases the values are 6, 7, 8 or 
numbers in the immediate vicinity of these integers.* Hence there is the 
equation 
r “yr ‘ 
U=c,T=nT, n=3, 5, 6, 7, 8, 


for the energy of any ordinary simple perfect gas. The entropy and free 
energy of the gas have the familiar forms 


S=c,log 7+ 2logV+ const. and yp =—2TlogV—c, T log T; 


and finally, the thermodynamic surface for the gas, which is the equation 
between the energy, entropy and volume, is 


Us V? = We’, Han arbitrary const. 


With these data for gases in mind it is easy to set up an analogy between 
the thermodynamics of a perfect gas and the averages obtained over a canoni- 
cally distributed ensemble of vibrating dinner-gongs. The results may be 
exhibited most graphically in double columns. ; 


Thermodynamics of a Statistical Mechanics of the 

Perfect Gas Dinner-gong 
temperature 7’ constant (modulus) @ 
energy U average energy € 


negative average index 


entropy SN : ies — 
P of probability ” 

free energy constant 

volume JV" external coordinate v « 7/3 


average external generalized 


oy a“ 
wressure p= — (—, , ° 
l (; ) r force A= — (< 


ov Je 
specific heat c, degrees of freedom n 
equation pV = 27't equation Av = 20 


* These numbers are of course the ‘degrees of freedom” of the molecule. Chlorine 
is a somewhat anomolous diatomic gas with c¢, = 6. Ethylene givec, = 8. 
t This is the so-called characteristic equation of Boyle for perfect gases. 
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equation U' = c¢,T equation € = »@ 

equation y = — 27 log V —¢,T log T equation y = — 20 logv — n@ log @ 

equation S = ¢,log7' + 2logi equation — y = nlog® + 2logu +n 
+ const.*® 

equation UV? = Ke*t equation e"7? = hk e-” 


It therefore appears that the statistical mechanics of the system contain- 
ing x gongs has all the formal properties of the thermodynamics of a perfect 
gas except that the arbitrary constant which enters into the entropy of a gas 
(and consequently into certain other formulas) is determined in the dynamical 
system. It may be well to note in particular some of the correspondences. 
The volume of the gas is related with the fourth root of the length of the up- 
permost string. If the volume is increased, the gas does work and loses 
energy; if the string is lengthened energy is lost to the system. To the 
specific heat corresponds the number of gongs, that is, the number of degrees 
of freedom in the system—and this is precisely the number of degrees of 
freedom assigned to the molecules of the gas. As the above equations 
contain the basis of all the thermodynamics of the gas and have identical 
analogs in the averages taken over the canonically distributed ensemble of 
dynamical systems, it appears that the development of the two could never 
diverge. This conclusion, however, is a bit hasty for the reason that nothing 
has been developed which is analogous to the mixing of two gases. As a matter 
of fact the mixing of gases depends on a principle in addition to those here 
given for a simple gas and contains a somewhat troublesome paradox known 
as Gibbs’s paradox. The matter of mixing gases and the analogous combina- 
tions of dynamical systems will therefore be omitted from the present 
discussion. 

There is one point of general import which should be mentioned. The 
reason for replacing the study of one particular vibrating system of gongs by 
the study of average values over an ensemble was that nothing was supposed 
to be known concerning the initial conditions of any one system (page 








* The entropy necessarily contains an additive constant as it is defined by an integral for 
which there is no natural lower limit — the absolute zero of temperature being excluded because 
the integral diverges for T=0. Similarly y = U — TS contains an arbitrary constant and may 
be written y = — 2T log V — ¢,T log T + (ee — const.) T; but the origin of entropy may be so 
chose. as to make this final term vanish. The constant n which occurs in — 4 is not arbitrary. 

+ The equation of the thermodynamic surface. The constant X is arbitrary owing to 
he additive constant in 8S; but k=e-". 
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137-8) ; the only known fact was the total energy of the system and the 
differential equations of motion immediately derivable therefrom. The ques- 
tion naturally arises: How nearly do the average values represent the condi- 
tions likely to obtain in a system selected at random? This is quite like 
the analogous question in the theory of errors of observation: How re- 
liable is the result as determined by the theory’ A measure of the 
reliability of the result is assumed to be found in the value of the mean square 
error; and for the dynamical systems the average value of the square of the 
departure e — € of the energy from its mean value may naturally be taken as 
a measure of how far the average state differs from the state of a system 
selected at random. It remains, therefore to evaluate the average of (€ — €)?. 
Now it is clear that the relations 





ro 
! 


=— 2e€ + € 


™. 
te 


(e —e)? =e — Yee +e= 


» 
| 
t 


hold for the averages. It is therefore necessary to evaluate & ; and the device of 
differentiating under the sign of integration may again be advantageously em- 
ploved. By definition and by (5) we have 


‘ ‘+ © , Sand. 
€ =|. ‘ [ec os dp, te - dys = 50. 


Now differentiate with respect to © on both sides. Then 


“+ on r ike l C - ah - 
[- | - € [“o- ~-. ~ e? dp. +--+ dq, =3. 


"+ @ w-e "4 x ; 29 
or |: a [ee al dp, >» * dis om Y |. ° - fee a dp, diy; 


“+ @ 
+ (-) +3 [few © o dp, ss 2 dq; — 30°. 
: ey - , 
or iss, & - = 30°. 
) efj—wwpet+ =e € 


The substitution from (5) and (3) or from (5') or (3') gives 


= 126° or n(n+1)8@?; @-—-@=30" or 10’, 


— a Coe - — 
= a2 Ate et ee a Oris 
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according as the number of gongs is 3 or n. Hence the ratio 
(e—e)?: @=1:3 or I:n. 


It therefore appears that as far as the energy is concerned, the mean square error 
in assuming that the average represents a system selected at random is consider- 
able when x is small but not very large when x» is great and becomes quite 
negligible when n is very great. A similar calculation could be carried out tor 
the quantities 

(n— 7)? and (A — A)?; 
but this will be left to the reader. 

A simple device will give a system in which the average condition better 
represents the condition probably found in a system selected at random. For 
suppose that instead of considering single sets of n gongs, the attention is 
turned toward pairs of exactly similar sets or in general to a system consist- 
ing of V of the single systems of gongs. This compound system will have Wn 
degrees of freedom if each individual set has n gongs. The energy of the 
compound system will be 


N N 
€e= > € = 7 > 
1 1 


[ avi + Doi + Px to + 8G + (Gai — Mi)? + (431 — Fai)? + - ‘|. 


If the compound systems are to be canonically distributed over their repre- 


u-e 

sentative space of 2.Vn dimensions, the probability will bee ® where ¢ has 
this new value. As there are no relations between the different sets of coor- 
dinates and momenta 7);, P2;, Psis - * ++ Vitis Yair Yuin - *y it is readily seen that 


the quantities connected with the compound systems are 


y=—2NOlogv — Nn Olog@ 


~ 6p — « 
= =— 2Nlogv — Nn log @ — Nn 
e=yY — 067n = NrO 

- e ,@ - 

A=— ~~ = 2 >? Av = 2N@0 





(e—e€)?: @=1: Nn 
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By taking XV sufficiently large, the probability of the energy of an arbitrarily 
selected system being noticeably at variance with the average energy is very 
small. Moreover the only difference between the new and the old equations is 
that y, 7, €, A, are multiplied by the constant V — a difference which is of 
the same sort as if these quantities were measured in different units. The 
thermodynamic analogy therefore persists in all its essentials. It-even has an 
additional advantage in that now any one system represents the average or vice 
versa with great probable precision, which is surely true of any particular 
sample of gas and the average of all similar samples. 

3. Microcanonical ensembles. <A _ microcanonical distribution is 
derived from the canonical by confining the attention to those systems of the 
ensemble which are characterized by representative points between the surfaces 
e= Cande + de = C + dC in the representative space. Now as the proba- 
bility depends only on the energy and the energy is sensibly constant between 
these two surfaces, it follows that the actual infinitesimal probability (see page 
134) that the representative point shall lie between the two successive 


surfaces is 
Vj —_? iV) * 


PdV=e9* dV=e o ) de, 


. 


where V represents the volume of the six-dimensional space included by the 
surface « = C’ —a quadric surface in pj, P2, P35 J15 J2. Jz, —and dV represents 
the increment of volume between this surface and the succeeding one. 

It is necessary to calculate the volume 


eC 
v=: : -f dpvipslpatastaste 
«= 0 
within the ellipsoid « = C, where for definiteness and simplicity the problem 
has been restricted to the case of the single system consisting of only three 
gongs which is the case tirst handled in the previous discussion. ‘The easiest 
way of evaluating V’ is to make the change of variable 


x; =Vra, py, rg = Viriz Po, rs = V7ra3))3, Y= Vrv-2Q, % = VT( G2 —N) 


— vy? 
%=VT(73— 92), Iy--- d= da, + +» drt 





*As a system with a definite length for the uppermost string is under discussion, 
the quantity v is supposed to be constant at this point; later it will be allowed to vary. 

+ It may be recalled that to find the new element of volume when changing the variable 
in a multiple integral the Jacobian is introduced. 
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The integral to be e raluated then takes the form 


oe, a 
v= — |. > | doydrglegledheglite 


’ 


where ait a+ apt apt opt re SC. 


This is a classic integral evaluable by means of the [T— functions. 


* 


result is 


v= a - biidu [ Wedh = } (3,2 = } er?, 
7m F(S) Io , 


Now the expression for the probability may be written 


u-e ] 2 
PdiV=e°e . 5 A de; 


= 


and may be given a more convenient form by letting 


dV 
= log ( ) = 2loge — 2logr — log2, 
. 


de 


ye 


so that PAV=e% ** de and | eo ** dex il, 
0 





the last condition being the new form of (2)- 
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The 


(7) 


Now a second thermodynamic analogy may be set up. This time the 
analogy is between the quantities arising in the theory of a perfect gas and 
the average taken over a microcanonical ensemble. Equation (7) may be 


written as 


ey? = keloe V and compared with UsvV? = KeS. 


It is then seen that the following are analogical : 


In the Ensemble In the Gas 
the energy e the energy U 
the quantity v the volume V 
the quantity log V” the entropy S 








*See Todhunter’s Jntegral Calculus, p. 263, article 278. 
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The list may be further extended by the use of the fundamental thermody- 


namic formula 
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de = dH — pdV = TdS — pd V 


compared with the result of differentiating «, namely, 


Ge . Ce : : Ce 
de = (ser) d log J - (; ) dy = e-* Vd log J + (; ) dv. 
dlog | v ov log V OVS log V 


Then the following correspondences may be set up: 


In the Ensemble 


Ce 
a. a es 2 
(ser “Ngee he 


5.) 2e 
ov log V bv 
Kquation Av = 2e-* V 


There seems to be no need to carry the table 


have been found for the system with x gongs. 


Equation pV = 27 


In the Gus 


the temperature 7’ 


the pressure p 


. ee : 
further. Similar results could 


This second analogy has the distinct advantage that the energy of the 


dynamical system corresponds to the energy of the gas, whereas in the earlier 


analogy it was the average energy over the whole ensemble which corresponded 


to the energy of the gas. 


This is not the place and the necessary fine dis- 


tinctions between closely related ideas in the theory of probability have not 


been sufficiently set forth to enter profitably into the discussion of the relative 


merits of the two analogies. 


On the whole, physicists seem to prefer the 


second, apparently mainly for the reason that the correspondence of the 
energies seems to them more natural; but Gibbs evidently inclined rather to 
p efer the first and to feel that (in view perhaps of the facts that for systems of 


so many degrees of freedom as a gas the energy of any one system could 


scarcely be expected to differ appreciably from the average energy and that 


we really do not know what the energy of the gas as a dynamical system is 
except in so far as it makes itself felt through the sort of average phenomena 
of pressure and temperature) the first correspondence was quite satisfactory ; 


there are, moreover, some other reasons why it is more satisfactory. 
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Conclusion. The foregoing simple dynamical system in which the com- 
putations of finding averages have been so easy as to be carried out without 
difficulty and for which the thermodynamic analogies are immediately compar- 
able with the ordinary theory of gases has been treated as far as possible in 
parallel with the gen-ral discussion of statistical mechanics given by J. 
Willard Gibbs in his Elementary Principles in Statistical Mechanics. That 
the simple and concrete case may perhaps be useful as an introduction to the 
general and abstract presentation of the statistical mechanics of any system which 
satisfies the Hamiltonian equations of motion is its only excuse for existence. 
There are, however, a number of matters which suggest themselves in connec- 
tion with the foregoing work and with statistical mechanics in general which 
may descrve a moment's notice. 

In the first place it might be considered remarkable that a system so 
simple and so apparently unlike a gas should give the thermodynamical equa- 
tions of a gas. <A reference to Gibbs's work will show, however, that the 
fundamental differential equation of thermodynamics and the fundamental 
equation of statistical mechanics 


dU = TdS—plV and de =@d(—n)— AdV or de = e~* Vd log V — Adv, 


whether in the canonical or microcanonical ensemble, will always correspond 
when there is only one variable external coordinate in the system. — It there- 
fore only remains to select the dynamical system in such a way as to give the 
proper special results fora gas, namely, the characteristic equation, the free 
energy, and soon. The selection was easy, especially in view of the general 
law of equipartition of kinetic energy. 

A few words may be said in explanation of this famous law propounded 
by Maxwell and discussed by practically every student of the theory of gases 
since that time. In the case in hand, the kinetic energy is a sum 
T(O) Pi + Az py + Gy p;) Of the squares of the momenta and it is clear that the 
individual terms re, pj, Td, p3, 77,0; represent the kinetic energies of the in- 
dividual gongs. The average value of these partial kinetic energies is easily 
found. For instance 
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+m v — ra,p Stn P2—% 
| Ta, pj 2 pra, p;? dp, f fe dp n+ * dq 
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| e7P? dp, fi fe dp, e @ ¢ dqz 
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Now the integrals though properly iterated, may be considered as merely 
multiplied : for p, does not occur in the last five provided the limits are not 
taken as functions of p,. Hence the last five integrals may be canceled from 
numerator and denominator and the calculation of the simple integrals remain- 
ing gives 





api = 3 @. 
It therefore is seen that the average of each partial kinetic energy correspond- 
ing to each individual gong is 4 © and the average value of the total kinetic 
energy is 430 or 4n@. The fact that each gong has the same average kinetic 
energy as every other constitutes the law of equipartition of kinetic energy 
for this particular case. With the proper conventions the same result is ob- 
tained for any dynamical system. 

The proof that the average kinetic energy in a system of n degrees of 
freedom is 4 x © and that each degree of freedom has _ its aliquot share of the 
whole depends, as Gibbs gives it, on the assumption of the canonical distribu- 
tion including the hypothesis that the momenta may have any values 
between — o and + » and that the limits of the integrations with respect to 
the coordinates 9, 72, +--+, 7, do not depend on the momenta. If any of 
these assumptions were untrue, the proof would not hold. The assumptions 
seem ideally simple with the exception of the initial assumption of canonical 
distribution ; and at any rate the fact of the at least approximate truth of the 
law of equipartition of energy is so strikingly contirmed in the theory of gases 
as to justify the detailed discussion of any reasonable theory which has it as a 
consequence. This will perhaps serve as an answer to those who might inquire 
why the particular law of density or probability for the ensemble known as the 
canonical law was chosen for the work of this article; there are moreover 
humerous more or less accurate and plausible “proofs” of the law or reasons 
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for it which may be found in various treatises on the theory of gases and in 
Gibbs's book ; but they cannot be taken up here. Other assumptions for the 
density or probability would have enabled us to compute averages but would 
in all probability have been destitute of thermodynamic analogy. 

Although the general theory of statistical mechanics reached a high de- 
velopment at the hands of Gibbs, the detailed application of the method of 
investigation to systems that are of thermodynamic importance in physics has 
scarcely been touched; special methods and approximations for obtaining 
averages, under the initial assumption not of the canonical distribution but of 
the law of equipartition of energy, have generally been employed. Indeed it 
is readily seen that to evaluate an iterated integral like 


[+ [apr «apy fo feo dy «+ 


J-« 
(even where the limits for the integrations with respect to the q’s are taken 
as independent of the p’s) is out of the question for most sets of limits and 
for most assumptions for the potential energy e, even if the simple assumption 
be made that the kinetic energy consists of a sum 2a,p? with constant coeffi- 
cients. For instance if x, y;, 2; and x;, y;, 2; are the coordinates of the 
centers of two monatomic molecules of a gas, and if the usual law of attrac- 
tion inversely as the square of the distance and repulsion as the inverse fifth 
power of the distance be assumed, the potential energy would take the form 
€, os 2 | - : bas . = : Pa ; 
Ty LV (4 — 4)? + (Yi -— ¥))? + (i — 2)? 


4 
(4 - z;)? + (yi — 





yj)? + (%— ayath 
to which other terms must be added to keep the molecules within the vessel 
containing the gas. 

Despite the difficulties which seem to lie in the way of the systematic 
practical application of the statistical mechanics of a canonical distribution of 
dynamical systems, the fact that the theory leads to the fundamental differential 
relation of thermodynamics will probably lead to its detailed investigation. It 
is hoped that the presentation here given may be an aid to those who wish to 
inake a start on this field of science. 


MASSACHUSETTS INSTITUTE OF TECHNOLOGY, 
Marca, 1909. 











DISCUSSION OF A METHOD FOR FINDING NUMERICAL 
SQUARE ROOTS 


By C. L. Bouton 


In a recent article* Professor E. R. Hedrick gave a method for approxi- 
mating to the positive square root of any positive number. Inasmuch as 
he merely states the method without giving proofs, and as the proof itself is a 
rather pretty exercise in the elementary theory of limits, it seems worth while 
to give a discussion of the method. The facts that the method is really 
Newton’s method of approximating to the square root, and that it may easily 
be extended to the p™ root, are of interest. Professor Hedrick’s plea that 
this method be substituted for the method of extracting square root ordinarily 
used seems to be scarcely justified on the ground of rapidity of convergence, 
although there may be pedagogical advantages in its use with young pupils. 
The method has the further disadvantage that when applied to a number which 
is a perfect square, it yields the square root only as the result of an infinite 
process, unless indeed the /rs¢ trial root should turn out to be the exact root ; 
whereas, the ordinary process applied to this case always terminates in a finite 
number of steps, provided that the given number is an integer, or a finite 
decimal fraction. 

The process is the following: Let @ be the given number, select any posi- 
tive number x, and form in succession 


l a 
ny, => 5) (» + i) 


1 a 
ig == 3 (» + . , 
(1) Qe aN 


| 1 a 
n= 5 (m1 + —~) : 


Then we shall prove that the numbers of the sequence 7), ”, ++ +5 %, 
form a decreasing set, and approach Va as their limit. The process converges 





* Western Journal of Education, vol. I, October, 1908, p. 252. 
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rapidly enough to be useful in computing numerical square roots. For ex- 
ample, let @ = 2, and choose n = 1. Then we compute in succession 
m= 1.5, ng = 1.417, nz = 1.414216 
Thus in three steps the error is about 2 in 10°. 
To prove that limit (»,) = V7, and to estimate the error at any step, let 
i= 


us proceed as follows: First we have the identity 


a\? ni_y—ay? | 

(2) n~—-a= 4 (m1 a ——) —-az= (—*) = 0, 

whence 

(3) n; = Va (i=1,2,---). 


That is, 7, 73, --- are all greater than Va, unless n was chosen equal to Ya, 
in which case ny = 2 =--- =a. From this point on we shall assume that 
n was not chosen equal to \a, so that in (2) and (3) the equality sign cannot 
hold. Moreover, from (1) and (3) we have 


a iL , 

(4) N;_, — n; = ————- > 0 (t= 2,3,---), 
2; 

so that 

(5) Ne < Nj ((= - 3, see). 


Therefore the variable 7; always decreases when ¢ increases, but n, is al- 
ways greater than Vv. Hence by a fundamental principle in the theory of 
limits * the limit of n; exists when ¢= x ; let us call it VV. We readily see 
that this limit is ¥, for from (2) and (4) we have 


(6) n—-Ya = Ce ses *\ = (mj —1 — mi)" (¢=1,2,---). 
n 


; ? 
n+ Va oni} \a 


Passing to the limit for i = x , we get 


N-ya= i <2 a" 
iV + Va 
N = Va. 


* See, for example, Osgood, Introduction to Infinite Series, p. 4. 














1909] METHOD FOR FINDING NUMERICAL SQUARE ROOTS 169 


This proves the convergence of the process. It is easy to set limits to 
the error at any step, and to form some idea of the rapidity of the convergence. 
We have from (6) and (3) 


(4714 — - n;)° 


(7) Se... “<n—vVa< (Mi-1 a ni)? < ni(Mi—1 — 7)? 
2n; ’ 


2Va 2a 





That is, if we set n;_,;-—7;= 6; and n;— Va =e,, then 


| 8; no; . 
(8) e+ on (:=1,2,-.--). 


Furthermore, from (7) and (3) 





(9) ni—Va i (n;_; — 7;)? oe ie = va’ 
Va 2a <9 Va 
Let uo = r;,, 80 that r; is the relative or percentage error at the 7" step. 
a 
Then 
(10) nr <4 ry (§=2,3,---). 


Repeated application of (10) shows that 


(11) a ath ++} 





n-a 


Formula (8) shows that the actual error of any approximation is nearly 
proportional to the square of the difference between that approximation and 
the preceding one. Formula (10) gives the relation between the relative 
error at any step, and that of the preceding step. It shows that when working 
decimally the number of significant figures retained at any step should be 
double the number which are correct at the preceding step. These formulae 
both show that while the process may not converge very rapidly at first, it 
later converges with extreme rapidity. Thus in the computation for V2 we 
found r; < 2 x 10~®, hence r, < 2 X 10-" andr; <2 x 10-*. 

Formula (11) enables us to estimate the number of steps necessary to 
obtain a given degree of approximation, and furthermore shows the effect of 
the choice of the initial n. Of course the nearer n is chosen to Va the more 
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rapid is the convergence. The method is therefore particularly useful for 
computing square roots when an approximate value is known, as for example, 
in extending a given table of square roots to a larger number of places. Thus, 
if we take a = 3, and choose n = 1.732, a single step gives n, = 1.732050808314. 
If to this we apply the correction computed from (8) we find 1.732050807569, 
while the value of V3 correct to thirteen decimal places is 1.7320508075689. 

Furthermore, it should be noticed that the commission of an arithmetical 
error at any stage of the work does not vitiate the ultimate result, for it 
simply amounts to starting over with a new value of ”. In fact, such a mis- 
take may even increase the rapidity of the convergence. 

A geometrical proof of the convergency of the process is easily given, 
Draw the rectangular hyperbola whose equation is 


LY = dad. 


On this hyperbola connect the points (». ") and (“. n). Let C, be the 
uu it 

















¥ 
Bk(¢.*) 
B, Cc, 
B) c, 
Cc, 
mmN 
A, (n,¢ 
a | 
0 % a, nn n x 
FiG. 1. 


middle point of this chord. It is easy to see that the coordinates of C, are 
(n,, n,). Then repeat the process with 7, in place of n, and so on indefinitely. 
An examination of the figure shows at once that the points C,, C2, Cy --- 
are moving down the diagonal towards the vertex, but that they never go 
beyond the vertex. Hence the limit of n; is Va. This figure further shows that 
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the value of x assumed at the start has no influence on the final result, but 
that if a value of n near Va be chosen the process will converge much more 
rapidly than if we begin with a value remote from Va. 

It is interesting to apply this geometrical method to trying to find the 
square root of a negative number. The process does not converge. 

Finally, it should be noticed that this process is exactly that given by 
Newton’s method for approximating to the positive root of the equation 


x -—-a=0. 
In that method, the curve 


a 
- 


Y= — a 


is drawn, any value of x is selected, the tangent is drawn at the corresponding 








(ma) 





Fig. 2. 


point of the curve, and its intersection with the X-axis gives a new value of 
x, with which the process is repeated. In this particular case the equation of 
the tangent at (n, n? — a) is 
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y- (i? —-a)y= 2n(x — n) 


or 2nx=ytn ita 


whence n 


» 


1 a - Ps 
= = (» + =) , if n, be the X-intercept ; 


and the process is repeated. But this is exactly the method described above. 
Newton’s method applied to the p™ root gives at once 


| ==(¢ 1 ~ 
~"% = za} 


(12) 
1 . a 
Rn, *- [ — 1)n;_, + == |. 
If we write 
. _ €; 
N;-, — nj = Oo, Yu— Ya = “is : —- =f; 
Va 
it is easy to prove from (12) that 
Og A 
2n; — 2[n; —(p — 1)8) . 
d ° P mir l m p—2 


These formulae are direct generalizations of (8) and (10), for the actual 
and relative error of any approximation. 


HaRVARD UNIVERSITY, 


CAMBRIDGE, Mass. 











ON THE DIRECT PRODUCT IN THE THEORY OF FINITE GROUPS* 
By J. H. Mactacan-WEpDpERBURN 


1, Tuts paper is an investigation into the formation of groups by the 
process known as the “direct product.” The calculus introduced by Frobenius 
is used throughout with the addition of the sign A ~ B to denote the largest 
sub-complex which is common to the complexes A and B; simple isomor- 
phism between A and B is denoted by A ~ B. 

Groups are represented by Gothic letters and their elements by the cor- 
responding Roman capitals; ordinary numbers are represented by small 
Roman letters, and in particular the order of a group is denoted by the same 
letter of the alphabet as the group itself. Thus the elements of a group A are 
denoted by A,, A;, --- A,, a being its order. The identity is denoted ag 
usual by J. { 


2. If a group H has two sub-groups A and B such that (1) H = AB, (2) 
A ~ B= J, (3) every element of A is commutative with every element of B, 
then H is said tobe the direct product of A and B, and A and Bare called direct 
factors of H. 

The relation between H, A and B is convenientlyt denoted by 
H=A x B. Wherethe meaning is otherwise clear, the adjective “direct” may 
be omitted and A and B simply called factors of H. In this paper the term 
“factor” will be used solely in this sense. A group whose only factors are 
itself and the identity is said to be prime. 

It may be noticed here that, instead of saying that every element of A 
is commutative with every element of B, it is sufficient to state that A and B 
are invariant. t 

3. If Gis any subgroup of H = A x B, every element of G can be ex- 
pressed in the form G, = A, B,. The two sets A, and L, (r=1, 2, +--+ 9) 
form subgroups of A and B respectively which are isomorphic with G. These 





* Read before the American Mathematical Society, September 10, 1908. 

+ Cf. Hurwitz, Mathematische Annalen, vol. 45 (1894), p. 389. 

tCf. Dyck, Mathematische Annalen, vol. 22 (1883), p. 97, and Burnside, Theory of 
Groups of Finite Order (1897), p. 44. 
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subgroups, which are of constant occurrence, are conveniently denoted by A, 
and B,. These groups are, of course, only defined with reference to some 
particular method of breaking up H into factors. Thus if H = A X B=A x C, 
the first factorisation gives A, = J, while this is not the case when the second 
factorisation is taken, unless B= C, which is by no means necessarily the 
case. 

TueoremM 1. JfG isa non-abelian invariant sub-group of A X B, either 
AAG>TIorB~G>ol. 

If possible lt A~G=Z=B-~G. Then evidently A, ~ G ~ B,. 
Since A and G are invariant and A ~ G= J, every element of A is commutative 
with every element of G. Hence A, is abelian: for if G = A, Band Ay is 
any element of A, then A, A, B = A, G = GA, = A, BA, = A, A,B, and 
therefore A,4, = A,A;. But, since G ~ A, and G is not abelian, this is 
impossible ; hence the theorem is valid. 

THeoreM 2. JFAXB=AxXC, thenB~ C. 

For B and C are both simply isomorphic with the quotient group of 
A X B with respect to A. We may also notice here that B. = B, since A and C 
are both contained in AB.. 

If BC = B x A’ where A’ S A, it is easily shown, as in the proof of the 
last theorem, that A’ is a sub-group of the central of A. If then the central 
of A reduces to the identity, we have B = C. 


4. If H=A, X A, X--- X A, is an expression of H as the product 
of prime factors, A,, Aj, --- A, is said to be a prime composition series of H. 
Two prime composition series are equivalent if they have the same number 
of terms and if their terms can be so arranged that corresponding ones are 
simply isomorphic. 

THEOREM 3. <Any to prime series of a group are equivalent; and the 
correspondence of the terms can be so arranged that A, = Aand B,=B,if A 
and B ure corresponding terms.* 

Assume that the theorem is true for groups whose order is less than the 
order of the given group H, and let A, A, -- - A, and B,, B,, --- B, be two 
prime series for H. 





* This theorem is of course well known in the case of abelian groups. Incase of non- 
abelian groups it is implied in the known theorem that a direct product may be represented 
as a product of substitution groups written in distinct sets of letters. Cf. Miller, Transac- 
tions of the Amer. Math. Society, vol. 1 (1900), p. 71. 
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Let the factors be so arranged that a, = }, and consider the group 


G=A, (B, X--- x B,) 
= Bi x B,- -- x B,, 
where B{ = B). 
If G = H, then Bj = B,, and, since a, = },, we must have 


H =A, X B, X --: X B,. 


Theorem 2 then gives A, ~ B, and A, X --- X A, ~ B, X --- X B,, whence 
the first part of the theorem holds for H by hypothesis. The second part of 
the theorem obviously holds as regards A, and B,; it can be shown to hold 
for the other factors as follows. 

Let C=A, X--: x A, and D=B, x --- X B,. The relations given 
above then become 


H=A, X C=B, x D=A,; xD 
and p=q, c=d. 


Now AC, C,,- +: Cy contains A,, B;, - - -, B,and is therefore equal to H, 
hence C5, oes Cy =C. But c = d=h,b,.-.- b, and hence C = Cy, X Cy, 
xX +++ C,, thus giving a prime series for C. C, is therefore by hypo- 


thesis simply isomorphic with say A, and(A,)| = A,. But B, < A,C,, and 
b 
thence (A,)_ = (A,),. The second part of the theorem is therefore also true 
b r 


for H. 

Suppose now that G< H. It follows by hypothesis that A, being a 
prime factor of G is simply isomorphic with B, for some value of s > 1 
and that (Ai), = Ai (B,)a, = Bs, or else that A, ~ B}. 

If A, ~ Bi, we have A, ~ B, x --- X B, = J, since (Bj), = Bi, and 
therefore 


H B, 


A, Ar Xk + X Ap= FX BX: Xx Be 
1 


Hence, by hypothesis, to any factor B, (s > 1) there corresponds a factor A, 7 
satisfying the conditions laid down in the theorem, viz. : 


A, ~ B,, (As), = As and (B,) =B.. 
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Putting s = 1, for convenience, this gives immediately 
A; X A: X *e-* xX Ap = B, X Az X iti x A, = B, X B, X ++ & By, 


and hence as above we find that the theorem holds for H. 

The theorem is obviously true when the order / of H is unity and henee, 
by induction, it is true for any group. 

In some cases the factors of a group are unique. This is so, for in- 
stance, when their orders are relatively prime, or more generally if the 
orders of their centrals are relatively prime. The following theorem, along 


with the corollaries to theorem 2, shows the connection between the factors 
when they are not unique. 


TueoreM 4. Jf A and B are any two factors of H, A being prime, 
then if AN B>T, B has a factor which is simply isomorphic with A. 
Since B is a factor, AB=C xX B, and if A~B>J, then c<a. 


Hence C is not simply isomorphic with A, and therefore, by the previous 
theorem, B has a factor which is simply isomorphic with A. 


Tue UNIVERSITY OF 
EDINBURGH. 








EXISTENCE OF THE GENERALIZED GREEN’S FUNCTION 
By W. D. A. WEsTFALL 


In his second paper on the theory of linear integral equations,* Hilbert 
has established a relation between these equations and the boundary problems 
of ordinary, self-adjoined differential equations in the following manner: 

If in 


L(u) = 5 [re G] +1 


p(s) is continuous, together with its first derivative, and ¢(s) is continuous 
fora Ss 33, then 


(1) [eben - uL(r) as = [pew - wy), 


provided u and v are continuous with their first and second derivatives. This 
is Green’s formula for one dimension. A set of two linear, homogeneous 
boundary conditions that involve the vanishing of 


[ p(ow —_ w') |. 


for any two functions u and v, each satisfying these boundary conditions, is a 
set of Green's boundary conditions. 
A continuous solution of 


L(u) + ru =0, 


with a continuous first derivative, and satisfying moreover a set of Green’s 
boundary conditions, is called a characteristic solution. A continuous solu- 
tion G(s, t) of Z(u) = 0, satisfying the boundary conditions, but having at 
8 = ¢ the following discontinuity in its first derivative : 





. dG . [dG 1 
_ 7 ee ds s=at—e p(t)’ 





* Gottinger Nachrichten, 1904. 


(177) 
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is called a Green’s function. It follows from (1) that if u;, u; are character- 
istic solutions of 
L(u;) > Aj u= 0, 


b 
(2) [ue u;(s) ds = 0. 


With the Green’s function ((s, ¢) they satisfy the relation 


(3) uj(8) = XA; | G(s, t) u,(t)dt. 


1 

Moreover any function satisfying (3) with a Green's function G(s, ¢) is such 
a characteristic solution of Z(u) + Au= 0. If the Green’s function exists, 
the relation between integral equations and the boundary problems of differ- 
ential equations is thus established. 

If p(s) is positive ina Ss =4, it is known that either a Green’s func- 
tion or a characteristic solution of (1) = 0 exists, but not both.* In case 
a characteristic solution exists, in which case there is no ordinary Green’s 
function, Hilbert has defined a generalized Green’s function //(s, ¢), which 
replaces G in (3). If @,, @, constitute a complete set of linearly independent 
solutions of L(u) = 0, normalized and orthogonalized, that is, such that 


b "db “b 
[ w,o,ds = 0, | wids = 1, | oids =1, 
a a Ja 


then the generalized Green’s function is the Green’s solution J/(s, ¢) of 


(4) L(u) = (8) @,(t) + @:(8) w, (t),t 
for which 

b 
(5) [4 t) w,(t) dt= 0, n=l, 2. 


There follow in the article referred to certain examples of this generalized 
Green’s function, but no attempt is made to prove its existence in general. It 
is the purpose of this note to supply this proof. 





* The writer's dissertation, Géttingen, 1905, p. 20. 


t Hilbert considers only the case where only a single characteristic solution exists 
for \=0. 
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Let the two linearly independent solutions of L(u) + Au = 0 be z,(A, 8) 
and z,(A, s). These are analytic in A.* If a characteristic solution exists for 
all values of A it is given by 


C2 (A, 8) + CzQ(A, 8) = U(A, 8), 


where c, and c, are so determined that the boundary conditions are satisfied. 
Hence such a solution is analytic in A. From (2) 


b 
[ro 8) u(A+h, s) ds=0, 
a 
or, by the theorem of the mean, 


b b 4 » 
[vo. 8) }? ds + if u(r, 8) HO — Jd=0, 0<60<1, 





which is impossible for h sufficiently small, unless u(A, s) = 0. 

There exists then for some value of A, as AX = p, no characteristic solu- 
tion, but a Green’s function G(p; s, ¢), since L(u) + pu =0 is an equation 
of the same type as L(u) = 0, where g + p replaces 7. Let 

; 1 
E(ps 8,0) = Gps 0) + = [an (8) a (t) + 0 (8) & (0) 
FE is a Green’s function of 
L(u) + pu = a, (s) w(t) + @(8) @(¢), 


such that 
b 
(6) [ze 8, t) @,(8) ds = 0, n=1, 2. 


From the theory of linear integral equations we knowt that there is a 
continuous solution A(s, ¢) of 


(7) E(p;8,t) = K(s, + ef Elis, r) A(r, t) dr, 





* Picard: Traité d Analyse, vol. 2, p. 92. 
tI. Fredholm: Acta Mathematica, vol. 27 (1903). 
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or a not-identically vanishing solution of 


b 
$(s)=—p [2@ 8, t) P(t) dt. 


From (6) a solution of this latter equation is orthogonal to , and @,, that is 


b 
(8) [ons $(s) ds= 0, nal, 2. 
a 

It satisfies, moreover, all linear homogeneous boundary conditions satisfied by 
E in s, and is a solution of Z(u) = 0, as may be seen by direct substitution. 
It must then be a characteristic solution of L(u) =0. But equation (8) 
shows that such a function ¢ cannot exist, since @, and w, constitute a complete 
set of linearly independent characteristic solutions. There exists then a con- 
tinuous solution A(s, ¢) of (7). It is seen immediately that this is the gen- 
eralized Green’s function sought. 

It has been assumed in the above proof that two linearly independent 
characteristic solutions of L(u) = 0 exist. Neglecting the case where only 
one exists has affected the proof only in simplifying the notation. The follow- 
ing theorem has been shown: 

If p(s) is continuous together with its first derivative, and positive, and 
if q(8) ts continuous in a given closed interval, there exists for each set of 
Green’s boundary conditions for this interval either a Green’s solution or a 
generalized Green’s solution of L(u) = 0. 

This proof, with a mere change in notation, holds for self-adjoined linear 
differential equations of the 2nth order. 


CoLtumBia, Mo., 
FEBRUARY, 1909. 

















THE GAMBLER’S RUIN 
By J. L. Coonipce 


THE various problems which are connected with games of chance have 
interested mathematicians ever since the time of Pascal. In particular, the 
probability that when two men are playing together at a certain game, the 
one will end by ruining the other, and the probable number of turns before 
ore player’s ruin, has been studied by James Bernouilli, DeMoivre, Lagrange, 
Laplace, and others.* In all of these works, however, it is assumed that the 
amount of the stakes is the same at each turn, and the problem of showing 
that, regardless of the amount which is staked, the probability of ruin for the 
one or the other player, lies within certain definite limits, has not, so far as I 
have been able to discover, ever been undertaken. In general the mathemati- 
cian assumes, without any proof, that whoever indulges in a game favorable 
to his adversary has a very strong chance of being ruined, regardless of what 
system of play he may affect. The player, on the other hand, ascribing to 
the goddess of fortune a desire which she does not possess, to atone for past 
errors, works out a system of play in which he confides. It is the object of 
the present paper to show that no such system can have more than an easily 
calculable chance of success. 


1. Unlimited Stakes. Let us imagine that two persons called re- 
spectively the “player” and the “banker” are playing at a game of chance. The 
effective difference between the two is that the player has the right to settle 
the amount of the stake each time; and also to leave off when he chooses. 
Suppose, first, that the player has a capital which, compared with the banker's, 
may be roughly called indefinitely great. Such is, in fact, approximately 
the case when the banker conducts a public gaming establishment ready to 
meet all comers. The probability that the player will be ruined is, here, 
negligible. On the other hand if the player begin by staking the amount of 
the banker's fortune and doubles his stake every time he loses, the probability 
that he will end by ruining the banker, is very large indeed. For that reason 
public gaming establishments habitually set an upper limit to the amount which 


*See Todhunter’s History of the Theory of Probability, London, 1865. 
(181) 
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may be staked. Let us, secondly, suppose that whereas the player has only a 
fortune of m the banker’s fortune is quasi-infinite in comparison therewith. 
Let us say that a first game is finished when either the player is ruined, or he 
has doubled his original fortune. As in this game his stake is surely less 
than 2m we are under the second case of limited stakes, to be treated pres- 
ently, and the chance that the player will escape ruin is not over one-half ; 
unless the game is favorable to him, as is not the case in public gaming. If 
the player win the first game, let him start another which shall terminate when 
the one side or the other has won 2m. Once more his chance of escaping 
ruin is 3. Keeping on thus we see that if the player undertake to play an 
indefinite number of games, the chance that he will never be ruined is 
4x 4X 4.---; indefinitely small. 

If the player and banker have each a finite sum, the maximum stake is 
surely less than their combined fortunes, and we are under our second case. 


2. Limited Stakes. Let us assume that the player has a fortune m 
and that he undertakes to play until either he losses the whole of this, or wins 
the sum » from the banker. This is the usual state of affairs in public gam- 
bling, although some players being with a very indefinite notion of the value 
of n, which has the effect of making n very large, thereby greatly increasing 
the probability that the player will be ruined. When the player has won the 
sum n we shall, for convenience, say that the bank has been ruined, though 
the player of ordinary means who should really set n at the amount of the 
bank’s funds, would do a very foolish thing. For the sake of simplicity we 
shall assume that if the player is so nearly ruined that he cannot afford to 
stake as much as he wishes, then he stakes all that he has, whereas if his win- 
nings are so near to n that the residue is less than the banker should put up 
in order to cover the player’s usual stake, then also we shall assume that the 
player reduces his stake accordingly. Asa matter of fact it does not make 
any great difference how we treat these extreme circumstances ; unless m + n 
is a comparatively small multiple of the amount of the stake, or unless the 
game is grossly unfavorable to one side; when either of these cases arises, the 
ruin of the less favored party is almost certain. 

What do we mean by the game’s being favorable or unfavorable to one 
side ? 


ler 


If a man have mutually exclusive chances p, p, - - - p, where > P=! 
i 
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of obtaining the positive or negative values a, a, --- a, respectively, then the 


expression 
i=r 
> Pi% 
i=l 


is defined as his expectation. An individual play, or a whole game, is said 
to be fuir when the expectation of each person before entering is zero. Thus, 
let the player stake a with the probability p of winning, while the banker, 
whose favorable chance is ¢ = 1 — p, replies with a stake b. The player's ex- 
pectation is pb — ga, while that of the banker is ya — pb. The criterion for a 
fair play is pb — ga = 0, 


or —-=-, (1) 


Ina fair play between two persons, the stake of each is proportional to the 
probability that he will win. 

This rule is easily extended to any number of persons. 

Suppose, next, that we have a succession of two fair plays, the player's 
and banker’s chances and stakes being respectively, a), p,;; 4, 4 in the first, 
(tz, Pz; 5g, Yq in the second. The playei’s expectation from the game com- 
posed of these two is 


Pr P2(O + Og) + Pr 92(, — 2) + Pan(e — |) — N92(U + &) 
= (Pi + 1) (P26: — 92%) + (P2 + 2) (PA -— 1%) = 9. 


This reasoning may be continued indefinitely, whence 

Any succession of fair plays will constitute a fair game. 

The commonest games are those where p and ¢ retain constant values ; 
let us show that if such a fair game be continued indefinitely the ruin of either 
player or banker is certain. The player's fortune is set at m, the banker's at 
n. We first suppose that the stakes are constant, « and 6. If v turns be 
played, the probability that the player shall win not more than vp + & nor 
less than vp — & is, by Bernouilli’s theorem, 


k 


¢ jm 
= [eee 
\ 7 Jo 
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and for any constant value of /, this will approach zero as a limit as v in- 
creases indefinitely. Replacing / by «, which is supposed to be either posi- 
tive or negative, if the player win vp + « turns his winnings will be 


(vp +«) b— (vg —«) @=K(a+ 6). 


Take « numerically larger than the langer of the two quantities m/(a + 6), 
n/(a +), and allow the play to continue indefinitely ; the ruin of the one or 
other party is seen to be certain. 

The case where the player may alter his stake at will requires more deli- 
cate handling. Let us suppose that in v turns the player makes the stake 
a, 7, times, in general the stake, a;, 7; times ; the banker’s corresponding stakes 
being 6,, b,, - + + b,. 


r 


We have 


M 
f 


1 


The player's chance of winning a particular play is independent of the amount 
staked, so that of the xp + « plays which he wins, the most probable number 


. ; ’ rj 
to be associated with the stake, a;, will be (vp + «) —. Letus assume, there- 
v 


, rj — , 
fore that, of the a; play he wins r;p + <i e;, His total winnings will be 
> 


i=r ‘a ixr 
> [ (2 Es =) r; + « |b — Ss | (v a ") % ~ a 
Bd fa D4 
i=l i=l 
i=r 
—~ /K 
= z=. (< vs - «) (« a+ Ms) . 
i=1 ' 
Let us modify this slightly. Since xp + « turns are won, 
= 0 
i=l 


If, then a, be the smallest stake, we may write the player's winnings as 


i=r ; r 


L i | : 
" yy rj(a; + 4) ‘, e[ (4, — 4) + (6, - b,)}. 


i=1 i=‘ 
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The quantity « is, by Bernouilli’s theorem, equally likely to be positive or 
negative. If, after v, turns, it be found to be positive, after v. turns more it 
will have a greater chance of being positive than negative, but this dispro- 
portion approaches zero as a limit, as v, increases indefinitely, and may be 
disregarded when v, is very large. The quantities ¢, ---«¢, may be positive 
or negative, and may all have like signs, or opposite signs in varying propor- 
tions. Let us assume, hence, that / represents the probability that after »; 
turns, the sign of ¢; shall be that of « ; the probability that in ¢ series of 1, vp. 
v, turns, each, the ¢;’s shall never after v; turns, all have the same sign as « is 


j=t ‘=r ; 
Wia- We), 
j=l t=2 


and this quantity approaches zero as a limit as ¢ increases indefinitely. But 
when all the e’s but ¢, have the same sign as «, the ruin of the one party or 
the other is even more certain than in the case of constant stakes. 

The ruin of one party or the other being thus assured, we may easily cal- 
culate the probability favorable to each. Let P be the probability that the 
player shall ruin the banker, and Y = 1 — /’ the probability that the banker 
shall ruin the player. As the game is fair, the expectation of each is zero, 
hence 


m a. ae (2) 


m+n’ m+n 


Pra—Qm=0, P= 





It should be noticed that these values are independent of the amounts staked. 
We must now pass to the case of a game which is unfavorable to the 


player. Here 
7 pb—qa< 0. (3) 


a and b shall be constant stakes, except in the extreme cases alluded to, where 
the one side or the other could not afford such a stake. Let us write the 
equation 

S(x) = part? —at+q=0, 


J’ (x) = p(a + b)att?—? — art—!. 
The roots of /’(x) are 


b a 


NV p(ast)’ 


x=Q0Q, 
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so that there is but one positive root. f(x) hasa root 1. Moreover 


f'(1) =p(at+ 6) -—a=pbh—q<9, 
I(x =+O, 


so that f(x) has just one positive root, greater than unity. Call this a. 
Then 


patt?—at+q7=0, a>l. (4) 


We now make use of an ingenious device due to DeMoivre,* which consists in 
assuming that the player's fortune is composed of m counters whose values 
are a, a*, --- a™ while the banker has n counters a™+!, a™+?, .--a™*+", The 
player shall stake his a highest counters each time, and the banker his 4 lowest, 
so that when the lowest counter staked is a? the player’s expectation from the 
play is 


p(atte 4+ qttatl... gteti=t) — q(a™ + att? 14.4 atte—1) 


a 


 [p(att® — at) — 4(a* — 1)] 





a 





a* 
oe ane pert? a +g) a O. 
The game has thus been transformed into a fair one, the ruin of the one or 
the other is certain, and the probability favorable to each is equal to his for- 
tune divided by the sum of the two fortunes. The player's chance is, thus, 








i=m 
a' 
i=l a" — 1 
j=m+ n — a” +n l . (5) 
a‘ 
{=1 
The banker’s will be 
a™+ ni @qm 
Ps (6) 


It seems to have escaped the notice of previous writers upon this subject that 
unless a and 4 are equal, and divide evenly into m and n, the Values of a 








* The Doctrine of Chances, London, 1756, p. 52. 





































1909] THE GAMBLER’S RUIN 187 


or m + n must be susceptible of modification in this formula, in order that it 
cover the case when one side or the other can not put up the requisite stake. 
This fact is theoretically important, but practically insignificant, as we have 
already pointed out. When the player's fortune has sunk to ¢ < a, the chance 
favorable to him is certainly less than e/(m + n), a trivial fraction, when n is 
large compared with a. On the other hand, when the banker's fortune is 
reduced to » < 6, if the game be nearly fair, his chance is but little over 
n/(m+n). 

It is now time to take up the elusive case where the player may alter his 
stake at will. Assuming that a is the largest allowable stake, let us imagine that 
the player leads off with astake «’ less than a, but constantly plays a thereafter. 
IIas he improved or injured his chance’ The latter is, in the present case, 
made up of two parts, according as he wins or loses the first play, so that it 


has the value 
a™+o'_ |} a®-* — } 


P owen y+’ Gate] (7) 


Subtracting this expression from (5) we get 





a™ — pam t+” om qa™—* a™ Pre “ 
— gmtn _ 2) = av (amta_]) (Ps re +6 —4q)- 





If we write a’ = ap, b' = ph, p < 1 we see a’ = a; is a root of the equation 
pat’ —2*4+qH=0, 

and 
l<a< as? 


part’ —a%4+q< 0. 


The difference between (7) and (5) is positive, or the probability favorable to 
the player has been lessened. 

There is just one troublesome case where this reasoning is inapplicable, 
namely, where, after losing a’ the player would have to reduce his stake, thus 
altering a in the second term of (7). 

The player’s stake, in case he lose the first play, shall now be 





a 
m—a’=-— o>l 
o 


We must consequently increase a to a’. 
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The player's chance thus becomes 


am™th’ _ 1 a7m—a')_] 
P mtn yt? Gwtm (8) 


It is necessary to compare the second term of (8) with that of (7). To do 


so let us note the alteration in the former as o increases from 1. Expanding 
by Maclaurin’s theorem we have 


1 a 
gine) 5 [ (m — a’) log ale + 51] (m—- a’) log «| oF 4... 


q7(mtn) a= aaa 1 r 2 
[ «m + n)loga |o + si] Om + n) loge | * oe 





Let us write this 





_—" Cc’ , C ‘ 
f(a) _ "er ** aia 
(a) aN ota es 
2! 3: 
2 _ on \.3 
f(a) _ 2 _F- dite atte 3 (¢- - pre 
5 $(c)  (ce4 F ao + a a 


This expression is negative for o > 1, hence (8) is less than (7), and, a 
fortiori less than (5). 

We may pursue this line of reasoning further. As it was a mistake for 
the player to replace his first stake by another smaller stake, so it will be a 
further error for him to replace a in his second stake by a smaller value, and 
soon. We have already seen that any series of stakes whatsoever is sure to 
end in the ruin of the one party or the other, and we now see that no other 
series gives the player such a good chance as he gets by playing the maximum 
each time. We thus reach our fundamental theorem. 

The player's best chance of winning a certain sum at a disadvantageous 
game, is to stake the sum which will bring him that return in one play, or, if 
that be not allowed, to make always the largest stake which the banker will 
accept. 
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The principal deduction from this is of a moral nature. The average 
gambler will say “The player who stakes his whole fortune on a single play is 
a fool, and the science of mathematics can not prove him to be otherwise.” 
The reply is obvious: “The science of mathematics never attempts the im- 
possible, it merely shows that other players are greater fools.” 

Let us make some simple deductions from our formula (5). The 
player’s chance is always less than m/(m +n). A better approximation, when 
mand n are both small is 


: m+ 4 m(m—1)(a—1) 
(m+n) +4(m+n)(m+n—1 )(a—1)’ 





When m and n are equal the player's chance is 


; (9) 





a™ + | 
If the player should become frightened, and reduce his stake to a his chance 
Cc 


would be reduced to 
1 


a™ 4 1° 


When a and 4 are equal 
é= 7 (10) 


When m is decidedly large, the probability that the banker will be ruined is 


close to 


l 
= (11) 
3. The Game of Roulette. Asanapplication of the foregoing prin- 
ciples, I am going to consider the game of roulette, as played at Monte Carlo. 
This game has two advantages, the honesty of the banker is above suspicion, 
and the player’s disadvantage is less than in any other form of gambling with 
which I am familiar. The following description is taken from Sir Hiram 


Maxim.* 
“The roulette consists of a large circular basin, about two feet in diameter, 


withthe outer rimturnedinward. The bottom of the basin, whichforms the wheel, 





* Monte Carlo, Fucts and Fancies, Loudon, 1906, pp. 257 ff. 
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is of metal quite separate from the rim or sides, and is nicely balanced and 
mounted on a fine pivot so that when set in motion, it will spin for a consider- 
able time. The outer edge of the wheel is accurately divided into thirty-seven 
sections of pockets, eighteen of which are painted red and eighteen black. One 
is called zero, and is neutral in color. The pockets are numbered from one to 
thirty-six.” The wheel is set in motion, and a small ball started rolling around 
the edge in the opposite direction. The game consists in betting on the 
number or color of the division into which the ball will eventually roll. 
There are fourteen different methods of staking. The simplest are, red or 
black, even or odd, above or below eighteen. In each of these cases the 
player and banker stake equal sums. Ifa player stake upon a number, 
the banker puts up thirty-five times the amount. The upper limit for a simple 
stake, say red or black, is twelve hundred dollars; a player may not stake 
more than thirty-six dollars on a number. When the ball falls into zero, 
the player on a simple chance may either forfeit one half of his stake or leave 
the whole “in prison” till the next turn. If he be fortunate at this turn he 
saves his stake but gets nothing further, if he lose, the stake is gone forever. 
On the other hand all who have staked on particular numbers or combinations 
thereof lose their stakes at once on the arrival of zero. 

What are the probabilities of success for a player staking on a simple 
chance as red or black? We may imagine that he plays twelve hundred dollars 
each time, and this sum we shall call one, so that m and x represent the 
player’s and the banker’s fortunes respectively, expressed as multiples of 
twelve hundred. In seventy-four turns the player may expect on an average, 
thirty-six reds, thirty-six blacks, one zero followed by a loss, and one zero 
followed by a gain, which leaves matters just as before and may be counted 
out. We have then 


536 537 

y = ==, Y=ssa9 

I 73 / 73 
a=b=1, pee, 
36 


The player’s probability of winning is 
37\™ 
(55 = 


Bintan 
(5 ads 
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When m and n are equal this becomes 


| 
[31\"_ 
1 
Ga) + 
Let us next suppose that the player stakes upon a number. The amount 
which may be staked is, here, about one thirty-third of what it was before, so 
that the game is the same as if the stake remained constant, and the for- 


tunes of both banker and player were increased thirty-three fold. The prob- 
ability for the player will be the same as before if @ be equal to the thirty- 


: 37 ; ; ; ; : 
third root of (3) and will be less if a be larger. Now the thirty-third 
’ 


37. 
root of = is 1.00085. 
ot 
Tr ¢ 1 36 : 
lo find a, we have p = 37° 1 = 3 a=1l1b=35,a=l1+e, 
od ot 


(1 + €)%— 37(1 + €) + 3H = 0, 
(1 + 56 € + 630 &) — 37 — 387 € + BH = 0, 


651 


= — = 1.00 4 
630 . - 


l+e 


This method is, therefore, somewhat less favorable to the player than the other. 

A third simple game is played by staking on all the numbers but one. 
The maximum allowable here is thirty-five times thirty-six dollars, not 
far from twelve hundred, so that we may call the stake one as before. Each 
time the player wins at this game he sacrifices thirty-four thirty-fifths of his 
stake, so that 4 is one thirty-fifth. We shall further have 





whereas in the simple game 
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which shows that staking on thirty-five numbers would be an unwise method 
of play. 
When n is large, the chance of ruin for the bank is 


l 
a" 

We have no accurate data as to the amount of ~, but apparently as_ the 
bank's earnings in a year are between five and six million dollars* it is not 
unreasonable to suppose that the sum of one million two hundred thousand 
dollars might be produced in a very short time. The chance of ruin on red 
or black is therefore not above 


36 1000. 1 
7) ™~ 7OPISTOOOOO00 - 


I conclude with two more quotations from Maxim’s book. The first is 
from a letter from Baron Czyllak, quoted on p. 172. 

“The calculation of probabilities by means of which inventors of systems 
seek to master roulette, and also the theory of roulette, have for their author 
the same savant that is to say, the celebrated Pascal.” So does posterity treat 
the great theologian whose treatise on cycloids is called “Histoire de la 
roulette !” 

The next quotation is from Maxim himself, on p. 190, and gives the best 
explanation which I have seen for the fact the people continue to gamble : 

“Je me rends parfaitement compte du désagreable effet que produit sur 
la majorité de Vhumanité, tout ce qui se rapporte, méme au plus faible dégré, 
& des calculs ou raisonnements mathématiques.” 


CAMBRIDGE, Mass., 
Marcn, 1909. 





* Maxim, loc. ‘cit. p. 9. 
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